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Bell Laboratories Fiftieth Anniversary 


Since its formation in January 1925, Bell Laboratories has taken 
over the central role of scientific research and technical development 
from its predecessors in the Bell System. The pages of The Bell System 
Technical Journal have reflected a steady flow of Bell Laboratories 
technological contributions over the intervening years (see “Fifty 
Years of B.S.T.J.” in the July-August 1972 issue). It is, therefore, par- 
ticularly fitting that we give some recognition to this half-century 
anniversary of Bell Laboratories. 

With its preeminent contribution to such broad fields as information 
theory, solid-state technology, and electronic data processing, Bell 
Laboratories has been the creative force in a vital and closely integrated 
Bell System team for transforming technical possibilities into economi- 
cal telecommunications realities. Examples include the development 
and deployment of today’s vast microwave carrier network (now con- 
taining over 300 million circuit miles), the burgeoning array of elec- 
tronic switching systems—providing a new degree of flexibility and 
reliability in the handling of telephone calls—and the provision and use 
of an increasing family of versatile, sophisticated, computer-based 
systems for better network implementation and maintenance as well 
as for more efficient business operations. 

In an era when the Bell System is being attacked for its monolithic 
structure, while at the same time being generally praised for the quality 
of its service and its record in keeping costs down, we should carefully 
note that our spectacular progress is largely a result of the close ties 
among Bell Laboratories, Western Electric, the Operating Telephone 
Companies, and the AT&T parent company, and is not an independent 
phenomenon. 


The second half-century of Bell Laboratories scientific and engineer- 
ing activity is beginning with the same exciting promise as did the 
first. Innovative research in basic materials and processes flourishes. 
At the system level, trends in both switching and transmission are 
pointing to a steadily growing digital network with important ad- 
vantages in quality and flexibility of service. Giant projects such as 
toll xss switching and millimeter waveguide transmission are har- 
bingers of this future. Optical fiber transmission will form the basis for 
another avenue of important telecommunications advances. Low-cost 
electronics are extending the fruits of the solid-state revolution to 
the loop plant. The realization of control and processing functions in 
ever smaller, cheaper electronic packages opens the door to increased 
sophistication in system operation while maintaining uncomplicated 
interfaces with those who use and maintain such systems. An integrated 
Bell System team is the key to successful transformation of such 
technological promises into telecommunications service to the nation 
and, less directly, to the world. 


W. E. Danielson 
Chairman, 
B.S.T.J. Editorial Committee 
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Mode Coupling in an Optical Fiber With 
Core Distortions 


By D. MARCUSE and H. M. PRESBY 
(Manuscript received May 17, 1974) 


The variations in the geometry of a step-index optical fiber are deter- 
mined as functions of position along the axis of the fiber by an analysis of 
the backscattered light produced when a beam from a cW laser 1s incident 
perpendicular to the fiber axis. The power spectrum computed from this 
distortion function is then utilized with coupled-mode theory to predict 
the mode coupling, the reduction in pulse dispersion, and the accompany- 
ing increased radiation loss of the fiber. The theoretical calculations support 
experimental observations and account for a partial reduction in the multc- 
mode pulse dispersion. 


I. INTRODUCTION 


Optical-fiber communication systems utilizing incoherent light 
sources such as light-emitting diodes require the use of multimode 
waveguides to insure efficient excitation of the guide. Such guides, 
however, suffer from multimode pulse dispersion, because modes with 
higher group velocity arrive at the receiver earlier than modes with 
lower group velocity, limiting the information-carrying capacity of the 
fiber. 

Thus, in a fiber of length L, uniform core of index n, uniform cladding 
of index n (1 — A), and constant cross section, a short pulse feeding 
equal amounts of power to every mode at the input will arrive at the 
other ends with a width 7 = nAL/c, where c is the speed of light in free 
space.! For example, a fiber with A = 0.01 and ne = 1.5 will have a 
delay spread of 7/Z = 50 ns/km, a serious limitation on either high- 
capacity or long-distance transmission. 

Multimode pulse distortion can be reduced by introducing coupling 
between the guided modes.” The reduction in the pulse length comes 
about because some power traveling in a fast mode is eventually trans- 
ferred to a slow mode, while power starting out in a slow mode finds 
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itself at least partially in a fast mode, so that the extremes of the 
group velocity spread are partly equalized. The root-mean-square 
delay spread is proportional to the square root of the product of fiber 
length Z and the coupling length L, associated with steady-state power 
transfer.? The multimode delay spread 7 is thus reduced by a factor of 
(L./L)* which, in the case of strong coupling, LZ. <«< L, can be sig- 
nificant. 

One coupling mechanism that has been investigated in detail is 
geometric variations of the fiber along the longitudinal direction z.? If 
the deformation function f(z), which is defined by these variations, is 
expanded in a Fourier series, two guided modes uw and » with propaga- 
tion constants 8, and 6, will be coupled by the Fourier component 
whose spatial frequency is given by? 2 = 6, — 86,. In other words, the 
spatial period is the beat wavelength between the wth and vth modes. 
Using m1 = 1.5 and A = 0.01 as before and assuming a signal wave- 
length of 1.0 um and a fiber core 50 um wide, the beat wavelength for 
the adjacent lowest-order modes is 10 mm and for the adjacent modes 
near cut-off is 0.7 mm. Spatial periods greater than 10 mm will have 
very little effect on mode mixing and spatial periods less than 0.7 mm 
will create signal loss by coupling guided modes to the radiation field. 

In this paper, the observation is reported of interface irregularities 
“unintentionally” introduced into a fiber during the pulling process. 
Distortions that are on the order of several microns are detected and 
measured by a backscattered light analysis technique. With the dis- 
tortion function in hand, the power spectrum is computed and utilized 
to predict mode coupling, reduction in pulse dispersion, and accom- 
panying increased radiation loss of the fiber. This represents the first 
time that the distortion function of a real fiber has actually been mea- 
sured and utilized to predict transmission behavior. 


ll. EXPERIMENT 


The fiber studied was pulled from a preform produced by a chemical 
vapor deposition (cvp) process.‘ The core of the fiber is elliptical, with 
major and minor axes on the order of 50 um and 30 um, respectively. 
The corresponding measurements for the outside fiber dimensions are 
118 wm and 110 wm. A microinterferogram and a plot of the index pro- 
file are shown in Fig. 1. The profile is close to that of a step-index fiber 
with a very slight modification at the center. The maximum index 
difference between the core and the cladding is about m1 — ne = 0.0135 
or A = 0.0093. It is important to point out that the refractive index 
profile did not vary by measurable amounts (less than one part in 10‘) 
over distances of interest for mode mixing in the fiber (0.2 cm to 2.0 
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Fig. 1—Microinterferogram and index profile of cvp step-index fiber. 


em), though it did vary from one end of the 1-km fiber to the other. 

If the geometry of the fiber remained uniform along its length as in 
Fig. 1, the fiber would propagate a given set of modes that would not 
couple among each other, producing a pulse spread of approximately 
45 ns/km. The measured pulse width at the 10-dB point, however, was 
about 10 ns after nearly 1 km of fiber.® 


To determine the core-cladding interface distortion, which we believe 
is at least partially responsible for this reduction in pulse dispersion, 
a backscattered light analysis technique was utilized.* This method can 
be used to detect the parameter b/a, where 6 is the radius of the fiber 
core and a is the radius of the cladding, assuming constant indexes of 
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refraction for the core and the cladding.’ The technique has been ex- 
tended to make observations on extended lengths of fiber by the set-up 
shown in Fig. 2. Light from a cw He-Ne laser is directed to oscillating 
mirror M>. by means of fixed mirror M;. The oscillating mirror serves 
to transform the ~1-mm circular beam into a line 1-mm wide, with 
length determined by the amplitude of oscillation. This line impinges 
upon the fiber, and the backscattered light is detected with photo- 
graphic film. 

A typical backscattered light distribution is shown in Fig. 3. Figure 3a 
is the overall pattern arising from a 12-cm length of fiber, and Fig. 3b 
is an expansion of the section on which measurements are made. The 
magnification in Fig. 3b is approximately 1 to 1. Figure 3a is symmetric 
about the midpoint with sharp cut-offs at the ends, typical of the back- 
scattered light distribution. As we go from the center of the pattern 
outward, we observe a region of enhanced fringe intensity. The loca- 
tion of the last fringe in this region is determined by the parameter 
b/a, assuming constant n: and me.’ Here we interpret 6 and a as an 
average diameter. If the fiber were of uniform geometry along its 
length, the fringes would be straight parallel lines; the departure of 
this fringe from straightness gives a measure of the variation of b/a, 
and hence the distortion function of the fiber. 

Measurements were made on several 14-cm lengths of fiber taken 
from both ends of the 1-km length. The appearance of the fringe dis- 
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Fig. 2—Optical arrangement to determine core distortion function. 
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Fig. 3—Backscattered light patterns. (a) Complete pattern from 12.0-cm section 
of fiber. Slit in center allows for passage of incident light and bright spots in center 
are photographic artifacts. (b) Expansion of section of interest. 
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tortion was similar visually for other sections taken from intermediate 
portions of the fiber, but detailed measurements were not made on 
them. Figure 4 is a representative curve of the core distortion. The curve 
shows Ab = f(z) plotted versus position along the fiber axis, with 
measurements made every 2 mm. The f(z) variations are on the order 
of several percent. 

We believe that the core-cladding interface distortions observed 
were introduced into the fiber during the pulling process. The preform 
was reduced to fiber form in an oxyhydrogen flame-pulling apparatus. 
Hither the instability of the flame as a heat source or slight movements 
of the fiber because of air currents and back-flame effects or combina- 
tions of these are probably the contributing factors. Nonuniform pull- 
ing speeds and varying preform geometry may be additional factors, 
but they are expected to introduce variations with a much longer 
periodicity. 


ill. ANALYSIS 


Given the core distortion function, the Fourier transform is then 
obtained by computer. A typical transform plot is shown as a bar 
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Fig. 4—Representative core distortion function. 
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graph in Fig. 5. We see that significant components exist in the coupling 
regions of interest. It is our aim to relate this spatial Fourier spectrum 
of the distortion function f(z) to the pulse width reduction caused by 
mode coupling. The theory is described in detail in Ref. 8 for a fiber 
whose core maintains its circular cross section but is randomly bent. 
The core of the fiber examined in this paper has an elliptical cross sec- 
tion, and the observed variations consist of changes in the ellipticity. 
Modes of the elliptical fiber are very complicated and are difficult to 
apply to a mode-mixing analysis. For this reason, we simplify the prob- 
lem by assuming that the fiber core nominally has a circular cross sec- 
tion that deforms itself randomly into an ellipse. It is assumed that 
this model is capable of yielding order-of-magnitude estimates of the 
performance of the actual fiber. 

The core-cladding boundary r of a fiber with elliptical deformations 
can be described by the function® 


r =b-+ f(z) cos 2¢. (1) 


The constant radius of the perfect fiber is b, f(z) is the distortion func- 
tion shown in Fig. 4 whose Fourier power spectrum is shown by the 
vertical bars in Fig. 5, and ¢ is the angle of the cylindrical coordinate 
system. A fiber with the core-cladding deformation of (1) couples 
modes according to the selection rule Av = +2. The label », indicat- 
ing the azimuthal mode number, enters the field expressions’ via 
COS v¢ or SiN vd. 


10-7 
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Fig. 5—Fourier transform of core distortion function. 
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If the power spectrum of f(z) drops off sufficiently rapidly to limit 
mode coupling to nearest neighbors, the coupled power equations 
reduce to the form" 


OP ug 
Oz 
++ Vigo tectig GAL ple ae = Py) ae Reece wach yas [a Pd ta) (2) 


Pym is the average power carried by the mode labeled v, m. In a crude 
approximation (applicable for small » values and modes far from cut- 
off), the transverse propagation constant can be expressed as!! 





= | EU Ee © ag Be =% PK) o— | ane gen © eae = Ps an) 


Ky,m = (v + 2m) at (3) 


In addition to the already mentioned azimuthal mode number », we 
have also introduced the radial mode number m. Together, these two 
parameters form the compound mode number 


M = v+ 2m. (4) 


The expression in the second line on the right-hand side of (2) describes 
coupling among modes whose compound mode numbers differ by 
AM = 0. In the crude model indicated by the approximation (38), 
modes with the same value of M have the same propagation constant 
and the same group velocity. This is not strictly true, since the de- 
generacy does not exist in the exact theory. However, the modes with 
the same VW values are coupled very strongly, since coupling among 
them is caused by the large amplitudes at low spatial frequencies seen 
in Fig. 5. It is thus reasonable to assume that mode mixing among 
modes with identical values of M is so rapid that these modes carry 
equal amounts of power. This assumption causes the differences of 
the power with equal M values to disappear so that only the first line 
on the right-hand side of (2) remains. This coupling process is implicitly 
taken care of by requiring that modes with equal values of M carry 
identical amounts of power. Using the compound mode number, we 
write 

© yk = Py Priam = P uy2 i en een oa P y-2 (5) 


RAS AK = huse,M Diviicg2a te = hu,m-2.- (6) 
Equation (2) now assumes the form 


OP y 
0z 





<= huse,m(P ue =F Py) = hu,u-2(Pm = P y-2). (7) 


We proceed by adding all equations of type (7) with the same value of 
M.” However, in addition to this simple summation we use the fact 
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that there are M modes with mode number M, but M7 + 2 modes with 

number M + 2. By replacing the differences occurring in (7) by differ- 

entials, we obtain 

oP x 
0z 








oP 
= 2(M + 2)harse.m a 2Mhar, m—2 gE, (8) 


M aM aM 


The replacement of the difference terms with derivatives is an approxi- 
mation that is valid if there are very many guided modes, so that the 
M numbers are large and can be regarded as a quasi-continuum. With 
this approximation, (8) assumes the form 





yy OP 455 (ha 


OP ; 
Oz oM 


aM (9) 


We can express the compound mode number in terms of the cone angle 
6, at which modes with the same values of M appear in the far-field 
radiation pattern that escapes from the end of the fiber.*?? Using!’ 


eed (10) 
and 

00 T 

aM = 2kb° (11) 


with k = free space propagation constant and b = core radius, we 
obtain finally from (9) with hy = h = h(@) 


PCO) (5 oC aP 
ar aes &) ax noe): (12) 


The propagation of pulses is described by the time-dependent par- 
tial-differential equation 


oP ,10P_f(7\1 a4 oP 
ta ~ (a) ga5(" ae) (18) 


where v = v(@) is the group velocity of the mode labeled 6. Solutions 
to this equation are expressed in the form 


N ce) 
PE2w = Ss / ¢;(w) B(co, @)em PP Weg ivtdey, (14) 
j=l 2 
The function B;(w, 6) and the parameter p(w) are obtained as solutions 


of an eigenvalue problem.* The eigenvalue p‘) is expanded into a 
perturbation series 


p(w) = oD +4 7 a gepOcb eitaGh Ae 53x, (15) 
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The second-order perturbation of the first eigenvalue, p{”, deter- 
mines the width of the equilibrium pulse via the formula’ 


T = 4vVpL, (16) 


where T' is the full width of the gaussian-shaped impulse response of 
the multimode fiber measured between the 1/e points and L is the 
length of the fiber. It was assumed that the coupling coefficient h is 
independent of @. This assumption corresponds to a spatial Fourier 
power spectrum of the distortion function f(z) of the form 


(JF(@)|) = &. (17) 


Numerical differences between our present case and the example treated 
in Ref. 3 arise from the fact that the mode spacing (in 6 space) of 
nearest neighbors coupled by elliptical core deformations is twice as 
large as the spacing between nearest neighbors coupled by random 
bends. Taking these differences into account, we arrive at the following 
formula for the width of the equilibrium pulse: 


_ 1.26n2[(m1/n2) — 1)} 
Da ai (18) 


in which n; and nz are the refractive indices of core and cladding, c 
is the velocity of light in vacuum, C is the constant defined in (17), 
and L is the length of the fiber. 

For comparison with the length 7 of a pulse carried by uncoupled 
modes, as mentioned earlier, 





pa EK (19) 
c 
we use the “‘improvement factor,” 
T _ 1.26A3 
R=-= : 20 
T VCL en) 


with A = ni/n2 — 1. The numerical values of R are physically mean- 
ingful only if R <1. If R > 1, the guide length L or the coupling 
strength are too small for an equilibrium pulse to have established 
itself. The relative decrease of the pulse width achieved by mode 
coupling improves with increasing fiber length. 


IV. DISCUSSION OF EXPERIMENTAL RESULTS 


The step-index fiber used for this study exhibited unintentional, 
random fluctuations of the core-cladding boundary resulting in mode 
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coupling. According to (19), an uncoupled pulse width of 7 =45 ns/km 
should have been observed for A = 9.3 X 10-3; the actually observed 
pulse width was 7 = 10 ns/km. The observed core-cladding irregulari- 
ties are able to account for a substantial part of the pulse shortening. 

The function f(z) of (1) is shown in Fig. 4 for a 12.6-em section of 
the fiber. We consider the distortion of this fiber section as typical and 
representative of this particular fiber. The absolute square values of the 
Fourier components of the function shown in Fig. 4 are plotted in the 
bar graph of Fig. 5. It is clear that Fig. 5 does not represent the spatial 
power spectrum of the distortion function that enters the coupled- 
mode theory. The required power spectrum would have to be obtained 
by computing the Fourier spectrum of the function f(z) for the entire 
length of the fiber. An approximation of this function could be ob- 
tained by computing Fourier spectra for a large number of shorter 
fiber sections and averaging. This procedure is time-consuming if it 
must be done manually, and an automated process would be required 
to determine the spatial Fourier spectra of the core-cladding interface 
distortion of the fiber. 

In the absence of more information, we used the Fourier power spec- 
trum shown in Fig. 5 to extract information about the mode coupling 
process. Since a theory based on a fourth-power law [see (17) ] was 
already available, we approximated the data in Fig. 5 by fourth- 
power-law curves. The four curves plotted in the figure are possible 
approximations that may be roughly guessed from the bar graph. 
The numbers used to label the curves, T = 15, 20, 25, and 30 ns/km}, 
are the result of determining the constant C of (17) from the curves 
and using it to calculate the width of the equilibrium pulse according 
to (18). Which of the four curves is the most plausible approximation to 
the actual Fourier power spectrum is open to discussion. It appears to 
us that the curves between 7 = 20 and 7’ = 25 ns/km? seem to ap- 
proximate the power spectrum reasonably well. The Fourier compo- 
nents for high spatial frequencies tend to exceed the values of the 
curves, while the curves are a little high for small spatial frequencies. 
Perhaps the fourth-order power law is not the best approximation to 
the Fourier power spectrum. On the other hand, we do not have enough 
information to obtain an accurate power spectrum. The high spatial 
frequencies shown in the bar graph result from rapid fluctuations of 
the curve f(z) that are partly noise of the measurement process. It can 
thus be expected that the amplitudes of the high-frequency compo- 
nents appear exaggerated. 

The most important point of this discussion is the observation that 
the measured core boundary distortions are indeed of the right order 
of magnitude to help explain the observed pulse shortening. Our data 
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can easily account for an improvement of the pulse width by R = 0.5. 
Since an improvement of R = 0.2 has been observed, it appears clear 
that the core boundary fluctuations are an important contributor to 
the observed pulse coupling behavior. The remaining amount of cou- 
pling may easily have been contributed by the bends of the fiber axis 
caused by the support mechanism and/or tension on the drum." 
Mode coupling with a fourth-power Fourier spectrum of (17) results 
in a loss penalty of!® 
R’oML = 0.5 dB. (21) 


a) is the steady-state loss coefficient resulting from mode mixing. 
For an R = 0.5, we thus expect an additional fiber loss of ¢o? ZL = 2 dB. 
The observed pulse width improvement of R = 0.2 results in a loss 
penalty of ¢%Z = 10 dB. The fiber losses actually observed area = 30 
dB/km at a wavelength of 0.9 um. The difference between these loss 
values is attributable to absorption losses in the fiber material. 

The improvement factor R of (20) can be expressed in terms of a 


coupling length L,,? 
L. \3 


For R = 0.5 we have L. = L/4 or L. = 250 m, since L = 1 km was 
assumed. 


V. CONCLUSIONS 


Mode coupling in multimode fibers may be caused by a number of 
fiber irregularities. Random index fluctuations, random bends, and 
core-cladding interface deformations are the most likely candidates. 
In this paper we have considered mode coupling by core-cladding inter- 
face deformations observed by a light-scattering technique that ex- 
tracts the necessary information from the backscattered light of a 
laser that impinges on the fiber at right angles to its axis. The mea- 
sured information was used to estimate the amount of pulse shortening 
that might be caused by this coupling mechanism. We found that the 
observed magnitude of the core-cladding boundary irregularities can 
explain some observed pulse shortening. The remainder may be caused 
by random bends introduced by the surface roughness of the drum on 
which the fiber is supported, or by refractive index changes along the 
fiber. 

It appears that the backscattering technique used for the determina- 
tion of the core-cladding interface irregularities may be a useful tool 
not only for monitoring the precision of fiber drawing processes but 
also for predicting the amount of mode coupling and consequently the 
pulse performance of multimode optical fibers. As shown here, to a 
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good degree this information can be extracted from a fiber only 12.6 
cm long. 
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Quasi-Ray Analysis of Crosstalk Between 
Multimode Optical Fibers 


By A. H. CHERIN and E. J. MURPHY 
(Manuscript received April 3, 1974) 


A model based on frustrated total reflection of waves in a multilayered 
medium has been developed to analyze the crosstalk between multimode 
optical fibers. Kappa (x), the parameter indicating the power distribution 
among the modes of the fiber, and fiber cladding thickness play very im- 
portant roles in determining the crosstalk isolation between fibers. Sig- 
nificant but less dominant effects on crosstalk are due to variations in fiber 
numerical aperture, length, and transmitting wavelength. 


l. INTRODUCTION 


Crosstalk between communications circuits has long been a problem 
concerning engineers and designers in the telecommunications in- 
dustry. In a digital communications system using optical fibers as a 
transmission medium, at least a 30-dB signal-to-crosstalk ratio will 
be needed. In those applications where a lossy jacket around the fiber 
is undesirable, proper design of optical fiber cables and circuits requires 
an understanding of the parameters that control crosstalk between the 
fibers. 

A number of models can be found in the literature to describe cross- 
talk between optical fibers.!~’ Maxwell’s equations are the usual start- 
ing point for these models, and a field theory approach describing the 
coupling coefficients between individual modes results. Each of these 
models provides insight into a possible mechanism for describing cross- 
talk between optical fibers, and each is operationally useful for cal- 
culating crosstalk between single-mode fibers or guides with a small 
number of propagating modes. A very interesting model describing 
crosstalk resulting from scattering from a rough core-cladding inter- 
face is also described in the literature.® 

In this paper, a meridional quasi-ray tracing procedure is used to 
describe crosstalk between highly multimoded optical fibers. This 
approach is an extension of work developed by H. P. Yuen! and N.S. 
Kapany."-“ The mechanism for crosstalk coupling of energy between 
fibers is that of frustrated total internal reflection of waves in a multi- 
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layered medium. Integral expressions for crosstalk and transmitted 
power are developed in terms of the geometry of the system, the ab- 
sorption loss of the fiber cores, the transmission and reflection coeffi- 
cients at the core-cladding interface, and the energy distribution at the 
launching end of the fiber. It is assumed in this work that the propagat- 
ing modes within a fiber are uncoupled. The error resulting from this 
assumption should be small when the excitation is chosen to coincide 
with the equilibrium energy distribution and the coupling between the 
fibers is weak. A computer evaluation has been made of the integral 
expressions for crosstalk and transmitted power. Results of a study 
showing the functional relationship between far-end equal-level cross- 
talk (rpxT) and cladding thickness, mode energy distribution, length, 
numerical aperture, and wavelength are included in this paper, along 
with a discussion of future work in this area. 


il. DERIVATION OF GENERAL TRANSMISSION AND CROSSTALK FORMULAS 


For the system of fibers shown in Fig. 1, we present a general deriva- 
tion of the transmission of energy within a fiber and the crosstalk be- 
havior between fibers. We assume that each fiber in an assembly is 
excited by a source that focuses its power on the center of the entrance 
end of a fiber, exciting meridional rays as shown in Fig. 2. The ray 
incident at an angle 6, to the axis of the fiber is refracted into the fiber 
at an angle @ which can be simply related to 6, by the law of refraction. 
Consider a distribution of input radiant intensity or input power per 
unit solid angle F,(@, @) for some polar angle defined on the entrance 
surface. A certain fraction of the power, 7',(6@, ¢), will be transmitted 
into the fiber because of refraction at the entrance surface. Within the 
incremental solid angle, sin édéd¢, the power coupled into the fiber, 
dP (6, ), can be written as 


dP (6, ¢) = F(6, ¢) sin 6déd¢ = F (6, ¢)T.(6, @) sin 6déd@ ~— (1) 


or 


We assume here that an absorption coefficient a per unit length is 
defined for the fibers. The coefficient a takes into account both bulk 
absorption loss and scattering loss. 

The power flux just prior to hitting the core-cladding interface of the 
fiber for the first time is given by 


aP — _—a(d/2)cescd 
(G5), = eh 8) (2) 


where d is the fiber core diameter. 
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Fig. 1—System of optical fibers. 


The power reflected from the first reflection at the core-cladding 
interface can then be written as 


(sc) =e s/DemtR(6, 6) F (8, 9). (3) 
Ry 


The power transmitted into a neighboring fiber from the first reflection 
at the interface is given by: 


(aa )p = eT, OF, 4). - 
Q Jr, 


OPTICAL 


SOURCE 





Fig. 2—Meridional ray fiber excitation. 
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Fig. 3—Ray tracing of transmitted and crosstalk rays in adjacent optical fibers. 


The next section shows how R(6, ¢), the reflection coefficient, and 
T(6,¢), the transmission coefficient, are calculated. 7'(@, ¢) in this 
study is, in effect, the mechanism for obtaining crosstalk between fibers. 
Crosstalk is shown to be caused by frustrated total reflection of plane 
waves at the interface of a multilayered medium. The reflected rays 
continue their propagation down the fiber until they hit the wall again. 
A fraction of energy is then tunneled as crosstalk to the neighboring 
fibers, and another fraction is reflected. This process is repeated until 
the rays reach the exit end of the fiber, as illustrated in Fig. 3. Let us 
make the following definitions: 


(a ) = incident power density at the nth hit 
Pr 


(a ) = transmitted power density at the nth hit 
Te 


( « ) = reflected power density at the nth hit. 
Rn 


The following power expressions for the nth hit clearly hold. 


(ia Ne . (ia ) 200 ¢) (5) 
(ia), 7 (ca) 20 9) (6) 


dP = dP —adcscé 
Came 7 (i Dec @) 
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For a ray of angle @ relative to the fiber axis, the total number of 
hits M is the largest integer smaller than (L/d) (tan 6) + 4. 


m= | Ftano+3| (8) 


and the total path length is Z sec 6. The output power distribution at 
the exit end is therefore 


(55). = RO, FO, 8), (9) 
dQ output 
so that the total output power of the fiber is 
2r 9max 
Poutous = ff sin oF (0, 6) R™ (8, d)e““™"T(6, ¢)d6d¢. (10) 


If the exit end of the fiber is not matched to the surrounding medium, 
there is a transmission factor, 7,.(6, ¢) ~ 1. From eqs. (2), (5), (6), 
and (7), we obtain. 


(55 )p, — PLR, ILE, FO, 8). (11) 
Tn 
Let each crosstalk ray at the nth hit suffer a further attenuation 


A’(6, ¢, L, n) before it reaches the exit end. The total crosstalk output 
power is then given by 


Qr O6max 
P= i] / sin 6F (8, 6)T(6, $) 
oO oO M 
x u, [R(6, 6) ]"7A’(6, ¢, L, ne" 9d odp. (12) 


Note that M is a function of 6. Equations (10) and (12) provide the 
general formula for signal and crosstalk output power at the end of the 
fiber as a function of the system parameters. For a specific model of 
the crosstalk transfer, it is necessary to provide explicit expressions for 
F(6, ¢), R(0, ¢), T(@, ¢), and A’(6, ¢, L, n). A very simple model is 
discussed later in this paper. 

That quantity of particular interest in system design is the signal- 
to-crosstalk power ratio defined by 


S P sieput 
acs = 7 output | 1 
( N 2 Pxx 18) 


Far-end equal-level crosstalk in decibels is then defined as 


FEXT = 10 ogre {oum’. (14) 
Pxr 
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Ill. FRUSTRATED TOTAL INTERNAL REFLECTION IN A MULTILAYERED 
MEDIUM—A CROSSTALK MECHANISM 

In this section, a discussion is given of frustrated total reflection of 
plane waves in a multilayer medium. This mechanism is the one re- 
sponsible for crosstalk in our model and serves as the basis for our 
crosstalk power transmission calculation. 

Consider the usual representation of total internal reflection shown 
in Fig. 4. The ray is completely reflected when y exceeds the critical 
angle, Y. = sin! (n2/n1). The field amplitude decays exponentially 
in the optically rarer medium. This picture is quite different when we 
look at the energy flow of the waves more closely.u-" The Poynting 
vector in the rarer medium is by no means zero; only its time average 
vanishes. This energy flow is depicted in Fig. 5, which also shows that 
a displacement A exists, because of the Goos-Hanchen shift,!5!* be- 
tween the incident and reflected beam. For plane waves, this shift is 
readily calculated.!5 It increases with decreasing angle of incidence y, 
as expected. The Poynting vector also makes a deeper penetration 
when y is smaller. 

With this picture we can see that, if a third medium of refractive 
index, 71, is brought into close proximity to the first surface from below, 
as shown in Fig. 6, some energy will be trapped by it as the incident 
rays make their penetration. This trapped energy will then propagate 





Fig. 4—Total internal reflection from plane dielectric boundary. 


22 THE BELL SYSTEM TECHNICAL JOURNAL, JANUARY 1975 





ny > ng 


Fig. 5—Displacement of ray—the Goos-Hianchen shift. 


into the third medium. This phenomenon of energy transmission 
through a lower refractive index slab for incidence greater than the 
critical angle is called frustrated total internal reflection. It is a wave 
phenomenon outside the domain of pure geometrical optics and is 
exactly analogous to the tunneling of quantum mechanical particles 
through a potential barrier, which is classically forbidden. 

Consider now three homogeneous media that might represent the 
core, claddings, and core of adjacent fibers and model them by two half 
spaces of refractive index, m1, separated by a third medium, m2, of 
thickness, ¢. We have three homogeneous media for which the trans- 
mission coefficient can be obtained by matching boundary conditions 
directly. The result, for plane wave incidence, is well known in the 


y>y, 


ny> n2 
y 


Fig. 6—Frustrated total internal reflection in a multilayered dielectric medium. 
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literature!’ and has been derived in the appendix for the N media case 
and specialized to the three-media case discussed here. We assume that, 
in the vicinity of the core-cladding interface, the signal can be repre- 
sented as a plane wave and quote the result for the lossless case for the 
geometry and notation shown in Fig. 7. 


Let 
N1 Ei 4 t 
1= (2) sint 0, — 1| . (15) 
Ne 
The transmission coefficient, 7(6,¢), which can be defined as the 
ratio of the transmitted power flux to the incident power flux, is 
given by 
76,6) =| : | 


* cosh? B + [ (ney? — ni cos? 61) /2ninz cos 617 |? sinh? B 








1 
+ A, cosh? 8 + [(n3 cos? 01 — y2n1)/2 cos Annoy } sin? hg » (16) 
where 


A, = the fraction of incident power polarized perpendicular to the 
plane of incidence. 

A» = the fraction of incident power polarized parallel to the plane 
of incidence. 

_ Q9r 


r 


Formula (16) is written in terms of 6,, the incident angle at the input 
of the fiber, and is used in the calculation of crosstalk power later in this 
paper. 


B nol. (17) 





Fig. 7—Geometry for calculation of transmission and reflection coefficients in a 
single-layered three-medium dielectric. 
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IV. CROSSTALK FORMULA ADAPTED TO CIRCULAR GEOMETRY 


We assume now that the only crosstalk that is modeled here is that 
which is due to frustrated total reflection. The geometry of the rays 
near the fiber wall at a hit is illustrated in Fig. 8, which is a cross-sec- 
tional view of a fiber and one of its neighbors. Here, ¢ is the polar 
angle which also serves to define the length ¢. Using simple geometry 
we see that 

t = ccos¢ — d/2 — [(d/2)? — c sin? ¢ |} (18) 
and that the angle ¢,, for which the line ON is tangent to the circle 
O’ is 

om = sin d/2c. (19) 
Let EF denote the center point of the entrance end of the fiber. Then a 
meridional ray E'S hitting the wall of the fiber lies in the plane of inci- 





Fig. 8—Geometry of two adjacent round optical fibers. 
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dence EOS, which cuts the fiber O’ at a line passing through S’ parallel 
to the axis of the fiber. The transmission coefficient is calculated in 
the form of plane wave tunneling through a slab of thickness ¢, as in 
Fig. 7. When the medium between is homogeneous and lossless, eq. (16) 
can be used. 

The transmitted ray will propagate as a skew ray and emerge from 
a point a number of wavelengths from S’ depending on 8@, as is clear 
from Figs. 5 and 7. Since the displacement A is small compared to the 
path length between bounces and very small compared to the length 
of fiber we are interested in, we can essentially regard the ray to emerge 
from S’ for the purpose of calculating the path length of the trans- 
mitted ray. 

Skew ray analysis shows” that the length of the skew ray crosstalk 
path lengths are also given by L sec 6. We can now calculate the ex- 
pressions for Px and Poupu:. From the above discussion, substitution 
into the crosstalk integral, eq. (12), yields 


A’(n)e—tar—D cscO __ gah secé (20) 
and 
M M 
x, (RG, d) J" 1A! (n) eat B ese = e ah secd XR, ¢) n-1 


on AG] eo 


The crosstalk integral simplifies to 
Omax 2r 
PS / do / dé sin 6F (8, )e~**°°T'(8, 6) 


x[ Trae | 


Making a change of variable to ¢ with 


dp _ ce? — (d/2)? — (t + d/2)? 
dt (t+ d/2){(2 — #) [+ d? — ej?’ 


we can then write 


(23) 


6 max te d¢ 7 
Pxr= Nf ao [ dt ( a) sin 6F (6, t)e~°%*°°°T (6, t) 
Qo ty 


1 — R(6, t)™ 
«| kee | 


where N = 2 times the number of fibers adjacent to the excited fiber. 
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V. SIMPLIFIED CROSSTALK FORMULAS USED FOR INITIAL 

COMPUTER STUDY 

A specific model requires a knowledge of F'(6, ¢), R(6, 6), and T(6, ¢) 
in eqs. (24) and (10) to calculate the crosstalk between fibers. As a 
first-order approximation, the following assumptions were made: 


(t) The interfiber medium is lossless and homogeneous so that eq. 
(16) is valid as well as: 


(zt) T(8, 6) + R(6, $) = 1. (25) 


(iit) The input angular power distribution of the fiber is a gaussian 
function of the form 


F(@, d) = e O/nn* (26) 


where «x is a parameter that is a measure of the width of the 
beam and also an indication of how the power is distributed 
between the modes of the fiber. @, is the critical angle of the 
fiber. 

(cv) The air spaces between the fibers are replaced by cladding 
material for the purposes of calculating the transmission co- 
efficient, 7'(6, ¢). A partial check of this assumption showed 
that the primary contribution to crosstalk occurred for small 
values of ¢. Replacing the air spaces by cladding material 
caused a maximum error in crosstalk of 4.5 dB. 


Under the assumptions mentioned above, the crosstalk integral, 
(24), becomes 


= Omax t2 do 
Pxr = Nf ao [ at ( | 
X sin Gem W/o? _.—abseo0() — [1 — 76, t)]¥}. (27) 


The total output power of the transmitting fiber becomes 


20 Omax 
Praipat. = J dd i sin Be~ “00%g—absectr] _ T'(9, g) db. (28) 


VI. COMPUTER STUDY—SUMMARY OF RESULTS 


A computer program was written and the integrals (27) and (28) 
were evaluated for typical fiber parameters. A number of studies were 
made to determine how FExt [eq. (14) ] varies as a function of cladding 
thickness, numerical aperture, length, « (kappa), and wavelength. 
Table I is a guide that defines the parameters and relates the variables 
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Table | 





: Range of 
Far-End Crosstalk | Figure & 
Acs Wanchoreck Number Parameters Held Constant ane peedent 


d__CoreDiam_ | NA = 0.10, d= 25.4 um, L = 1 0.3 to 0.7 
= A = 0.6328 um ee ; 
ce Cladding Diam a = 20 dB/km, «x = 0.25, 0.35, 

0.4, 0.5, 10.0 


NA = 0.15, d = 25.4 pm, L =1 
10 km, A = 0.6328 um =O ved 
a = 20 dB/km, « = 0.25, 0.35, 
0.4, 0.5, 10.0 


NA = 0.20,d = 254yum,L=1 |d 
ll km, X= 0.6328 pm , 7 — 0.3 to 0.7 
a = 20 dB/km, « = 0.25, 0.35, 
0.4, 0.5, 10.0 


NA = 0.10,d =50.8pm,L=1 |d 
12 km, X = 0.6328 pm , . — 0.5 to 0.9 
« = 20 dB/km, x = 0.25, 0.35, 
0.4, 0.5, 10.0 


NA = 0.15, d = 50.8 um, L = 1 
13 km, \ = 0.6328 pm 
a = 20 dB/kn, « = 0.25, 0.35, 
0.4, 0.5, 10.0 


NA = 0.20, d = 50.8 um, L =1 
14 km, \ = 0.6328 ym 
a = 20 dB/km, x = 0.25, 0.35, 
0.4, 0.5, 10.0 


NA = 0.10, d = 76.2 um, L =1 
15 km, A = 0.6328 pm 
a = 20 dB/km, x = 0.25, 0.35, 
0.4, 0.5, 10.0 


NA = 0.15, d = 76.2 pm, L = 1 
16 km, 4 = 0.6328 um 
a = 20 dB/km, x = 0.25, 0.35, 
0.4, 0.5, 10.0 


NA = 0.20, d = 76.2 um, L =1 
W km, \ = 0.6328 ym 
a = 20 dB/km, x = 0.25, 0.35, 
0.4, 0.5, 10.0 


Numerical aperture 18 d = 50.8 um, c = 85 um, a = 20 | NA = 0.05 to 
dB/km, L = 1 km, d = 0.6328 0.30 
um, x = 0.25, 0.35, 0.40, 0.50, 


ola 
ola ola 


ofa 


alga 


— 0.5 to 0.9 


ala 


— 0.5 to 0.9 


ala 


— 0.5 to 0.9 


ela 


— 0.5 to 0.9 


el& 





— 0.5 to 0.9 


ela 


10.0 
Fiber length 19 d = 50.8 um, c = 85 um,a = 20 | L = 100mto 
dB/km, NA = 0.15 5 km 
A = 0.6328 wm, «x = 0.25, 0.35, 
0.4, 0.5, 10.0 
x (Kappa) 20 L = 1 km, d = 50.8 um, c = 85 x = 0.1 to 1.0 


pm, a = 20 dB/km 
NA = 0.15, A = 0.6328 um 


Wavelength 21 LT = 1 km, d = 50.8 um, c = 85 A = 0.6328 to 
pm, a = 20 dB/km 1.06 pm 
NA = 0.15, « = 0.25, 0.35, 0.4, 
0.5, 10.0 
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NA=0.10 
d=25.4jum 
L=1km 

d=0.6328 4m 
@=20 dB/km 


FEXT IN DECIBELS 





0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 
d/c 
(26.4) (19.05) (12.7) (8.5) (5.4) 


CLADDING THICKNESS IN MICROMETERS 


Fig. 9—rext vs. d/C (cladding thickness), d = 25.4 um, NA = 0.10, L = 1 km, 
> = 0.6328 um, a = 20 dB/km. 


in the study to the figure numbers appearing in this paper. Figures 9 
to 11 show, for numerical apertures of 0.10, 0.15, and 0.20, respectively, 
the relationships between FExT and cladding thickness for a 1-mil fiber 
core diameter. For the same numerical apertures, Figs. 12 to 14 and 
15 to 17 show this relationship for 2- and 3-mil core diameters. 
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FEXT IN DECIBELS 


NA=0.15 
d=25.4jum 
L=1km 

@=20 dB/km 
A=0.6328um 





0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 
d/C 
(26.4) (19.05) (12.7) (8.5) (5.4) 
CLADDING THICKNESS IN MICROMETERS 


Fig. 10—rext vs. d/C (cladding thickness), d = 25.4 um, NA = 0.15, L = 1 km, 
\ = 0.6328 um, a = 20 dB/km. 
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Fig. 11—Fext vs. d/C (cladding thickness), d = 25.4 um, NA = 0.20, L = 1 km, 
dX = 0.6328 pm, a = 20 dB/km. 
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FEXT IN DECIBELS 


NA=0.10 
d=50.8 um 
L=1km 
 =0.6328 zm 
@ =20 dB/km 





0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 
d/C 


(25.4) (16.9) (10.9) (6.35) (2.82) 
CLADDING THICKNESS IN MICROMETERS 


Fig. 12—rext vs. d/C (cladding thickness), d = 50.8 um, NA = 0.10, L = 1 km, 
A = 0.6328 um, a = 20 dB/km. 
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FEXT IN DECIBELS 
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d=50.8um 
L=1km 

\ =0.6328 zm 
@ =20 dB/km 
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d/C 


(25.4) (16.9) (10.9) (6.35) (2.82) 
CLADDING THICKNESS IN MICROMETERS 


Fig. 18—rext vs. d/C (cladding thickness), d = 50.8 um, NA = 0.15, LD = 1 km, 
= 0.6328 um, a = 20 dB/km. 
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Fig. 14—rext vs. d/C ee thickness), d = 50.8 um, NA = 0.20, L = 1 km, 
A = 0.6328 um, a = 20 dB/km 
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Fig. 15—rext vs. d/C (cladding thickness), d = 76.2 um, NA = 0.10, ZL = 1 km, 
= 0.6328 um, a = 20 dB/km. 
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Fig. 16—rext vs. d/C (cladding thickness), d = 76.2 um, NA = 0.15, Z = 1 km, 
dX = 0.6328 um, a = 20 dB/km. 
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Fig. 17—¥ext vs. d/C (cladding thickness), d = 76.2 um, NA = 0.20, L = 1 km, 
= 0.6328 um, a = 20 dB/km. 
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For practical claddings greater than 12 microns in thickness, in- 
creasing the cladding thickness will improve crosstalk isolation by 
approximately 0.8 dB/micron. Cladding thickness is an important 
design parameter for improving crosstalk isolation between optical 
fibers. If we attempted to eliminate crosstalk by coating the cladding 
with an opaque substance, cladding thickness would also play an 


FEXT IN DECIBELS 


d=50.8 um 
L=1km 


C=85j.m 
A=0.6328 um 
@ =20 dB/km 





0 0.05 0.10 0.15 0.20 0.25 0.30 
NUMERICAL. APERTURE 


Fig. 18—¥Fext vs. numerical aperture, d = 50.8 um, C = 85 um, a = 20 dB/km, 
L = 1km, A = 0.6328 um, «x = 0.25, 0.35, 0.4, 0.5, 10.0. 
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important role in determining the amount of loss suffered by the trans- 
mitted energy in the core owing to the lossy coating. A cross-check of 
Figs. 9 to 17 shows a very weak dependence of FEXT on core diameter. 

A fiber core diameter of 50.8 um and cladding diameter of 85 um was 
chosen to represent a typical fiber geometry in the remainder of this 
study. Figure 18 shows the relationship between FEXxT and numerical 
aperture with a fixed cladding refractive index of 1.458. For a given 
kappa, crosstalk isolation improves by approximately 6 dB by doubling 
the numerical aperture in the range from 0.10 to 0.30. For an NA 
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Fig. 19—rext vs. fiber length d=50.8 um, C=85 um, a=20 dB/km, NA=0.15, 
dX = 0.6328 um, « = 0.25, 0.35, 0.40, 0.50, 10.0. 
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= 0.15, crosstalk was evaluated as a function of length as shown in 
Fig. 19. For a fixed kappa, crosstalk isolation decreased by approxi- 
mately 2 dB per decade of length. The effect of kappa on crosstalk is 
shown in Fig. 20. Kappa is an indication of the distribution of power 
among the modes of the fiber. As kappa decreases in value, the energy 
distributed in the lower-order modes of a fiber increases. A knowledge 
of the steady-state mode distribution should enable us to estimate an 
effective x for use in this model. As illustrated in Fig. 20, « is an ex- 
tremely important parameter in determining the crosstalk between 
fibers. For kappas less than 0.5, crosstalk isolation is greater than 40 
dB for a kilometer length of fiber. For kappas less than 0.40, crosstalk 
isolation is greater than 50 dB. The final study, illustrated in Fig. 21, 
shows the relationship between FExT and wavelength. For a given 
kappa, doubling the wavelength of the transmitting signal decreases 
crosstalk isolation by approximately 7 dB. It is envisioned that typical 
fiber losses in a future optical transmission system will be less than 
the 20-dB/km loss reported here. We will extend, in the future, our 
computer study to include fiber losses ranging from 2 to 20 dB/km, 
but do not expect this loss parameter to change the general conclusions 
drawn in this paper. 

To determine if the primary mechanism for crosstalk in optical 
fibers is frustrated total reflection of waves in a multilayered medium, 
a crosstalk experiment on a long length of parallel fibers must be per- 
formed. In this experiment, crosstalk and kappa should be measured 
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Fia. 20—rexr vs. x (kappa), L = 1 km, d = 50.8 um, C = 85 um, a = 20 dB/km, 
NA = 0.15, A = 0.6328 um. 
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Fig. 21—Fext vs. \ (wavelength), L = 1 km, d = 50.8 um, C = 85 um, a = 20 
dB/km, NA = 0.15, « = 0.25, 0.35, 0.40, 0.50, 10.0. 


as a function of length at a number of different wavelengths. This type 
of experiment will enable us to check many aspects of the model. 


APPENDIX A 
Calculation of Transmission Coefficients in a Multilayered Medium 


The general expression for the transmission coefficients in a multi- 
layered dielectric medium is well known in the literature78 and is 
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presented here. This expression is simplified for the three-medium case 
and, ultimately, eq. (16) in the text is derived. 

Consider the geometry shown in Fig. 22. Let us suppose that between 
two semi-infinite media, denoted by 1 and n+ 1, there are n — 1 
layers of dielectric material denoted by 2, 3, ---, n. Let a plane wave 
be incident on the last layer at an angle of incidence 6,41 and let the 
plane of incidence be the x — z plane. As a result of multiple reflec- 
tions at the boundaries of the layers, two waves exist in each medium 
with the exception of medium 1. Our problem will be to determine the 
amplitude of the transmitted wave in medium 7 + 1 and hence the 
transmission coefficient. The following notation will be used. 


z; = the coordinate of the boundary between the jth and (7 + 1)st 
layers. 


d; = 2; — 2;-1 = the thickness of the jth layer. 

k; = (27/d)(n;) = the wave number in the jth medium. 

a; = k; cos 6; = 2 component of the wave vector in the jth layer. 
¢; = a,d; = the phase change in the jth medium. 


Z; = the self-impedance of the jth layer. 
j, = the input impedance looking into the jth medium from the 
7 +1 medium. 


The electric and magnetic fields in the jth medium can then be 
written as 


Ey, = Ajexp [—ta;(Z — Z;-1) ] + B; exp [ta;(Z — Z;-1) J (29) 


Hie = 7 {Asexp [—taj(Z — Zj-1)] — Byexp [ias(Z ~ Z;-1)]}. (30) 





Fig. 22—Geometry used for calculation of transmission coefficients in multilayered 
dielectric media. 
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The x and t dependency in this case is omitted for the sake of brevity, 
but assumes the general form: 


exp t(Knyix Sin On41 — wt). 


A; and B,;, B, = 0 are the amplitudes of the incident and reflected 
waves in the jth medium. The amplitude A+: of the incident wave is 
assumed to be known. To obtain the transmission coefficient of interest, 


Ay _ArAn Ana An 


a A AG at he ee wy) 





We can write 2n boundary equations for the tangential components of 
the fields and solve these equations for Ai, Ao, ---, Anyi; Bi, Bo, ---, 
B,+1. When these coefficients are known, the transmission and reflec- 
tion coefficients for the multilayered medium are obtained. 

A second approach, and one that develops an iterative scheme more 
suitable to a digital computer, describes the transmission coefficient in 
terms of a generalized input impedance.” This is the approach that 
will be followed here. It is straightforward to derive, and it is shown in 
the literature that: 

A; Z3+ Zh 


Aa ea exp (7;). (32) 
-) n 





Substituting into eq. (31) yields 


tap Zi + Zin 
peel ee 5, 3 
Tnt1 iH ua t Zio (2o;) (33) 
with d; = 0, where 
Zi, | — iZ; tan by 


Zi, = 
”" Z; — iZix- tan 4; 


Z;. (34) 

Specializing eq. (83) for the three-medium case discussed in the text, 

we can obtain, with some algebraic manipulation, the following 
formula: 

we 4Z ZL 

(Z1 — Z2)(Z2 — Z3) 6 + (Zi + Ze) (Ze + Zs)e™ 


For electric fields parallel and perpendicular to the plane of incidence, 
eq. (35) will take different forms. For F,, 


(35) 


Zo = 
Z1 = 708 3) 
Lg. hig 


(37) 








Ne COS Ae ee tney ‘ 
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where 


ae | (32) sine je 1} | (38) 


Z,= ,|* (39) 


€ 
B = kedoy. (40) 
After some algebraic manipulation, 


_ Ki cosh 8 — 7K1K2 sinh 8 


™. = “RF cosh? # + K}sinh®B ’ ar 
where 
Ky = 2nin2 cos Ay (42) 
Ky = ney? — n? cos? 61, (43) 
for Ey 
Z, = 22008 _ 7 (44) 
n1 
Ge MEO Oe as UY, (45) 
ne no 
and 7's,, becomes 
es Kk? cosh B + 1K 1K3 sinh B (46) 
iat K? cosh? 8 + K2 sinh? B ’ 
where 
Kz; = n§ cos? 6, — yn. (47) 


Utilizing eqs. (41) and (46), we can obtain the power transmission co- 
efficients directly : 


Ta, = | r2, |? 
7 1 
cosh? 8 + [(n3y? — ni cos? 61) /2n1n2 cos 617 |? sinh? 8 


Tay, i | 721 2 





(48) 


1 
~ cosh? 6 + [(n3 cos? 6: — y2n3) /2 cos Oniney |? sinh? 6 


Equations (48) and (49) were used to obtain eq. (17) in the text of this 
paper. 


(49) 
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Two Derivations of the Time-Dependent 
Coupled-Power Equations 


By S. D. PERSONICK 
(Manuscript received April 12, 1974) 


In this paper, the time-dependent coupled-power equations originally 
derived by Marcuse from intuitive arguments are rederived two ways, one 
using the coupled-line equations with perturbation theory and the other 
using the Kronecker product approach of Rowe and Young.* 


I. INTRODUCTION 


Suppose a multimode fiber guide with random mode coupling is 
excited by an optical source at one end at timet = 0. 

This input excitation will produce, at time ¢, a response of optical 
power in each mode v at a position z from the input end given by 
Bulby 2s 

Marcuse? has suggested from intuitive arguments that s,(t, 2) should 
satisfy the following differential equation 


5 Ob) + GF LD) =X lool Plu 2)) 
Fae (2 | Kyo?) (s(t, 2)), C4) 


where K,, is the coupling coefficient in the coupled-line equations 
(described below) between modes y and 2, F is the spectral height of 
the mechanical perturbation in the fiber geometry responsible for the 
coupling, (s,(é, z)) is the average? power in mode yp at position z, and 
C, is the group velocity in mode »v. 

In this paper, we derive eq. (1) from the coupled-line equations first 
using perturbation theory and then using the Kronecker product 
approach of Rowe and Young.’ 

The importance of this work is to show that the intuitive eq. (1) 
does in fact follow directly from (2) the coupled-line equations, (2) 


t Subsequent to the writing of this paper, it became known to the author that many 
of the results included in the paper were independently and concurrently derived by 
R. Steinberg of Columbia University.! 

+ Average for an ensemble of guides with similar gross properties and similar 
excitation. 


47 


voltage linearity of the guide (from Maxwell’s equations), and (777) 
ensemble averaging over the random coupling perturbation. It is 
encouraging to note that skillfully framed power-flow arguments such 
as those used by Marcuse lead to the same results as the more cumber- 
some approaches that start from the coupled-line equations. 

It should be pointed out again that, however it is derived, eq. (1) 
describes the average of the flow of power for an ensemble of guides. 
How the flow of power in a particular guide compares to the average 
flow is still an open subject. 


il. ANALYSIS 


We start with the coupled-line equations that describe the z evolu- 
tion of -4,(w, z)—the complex amplitude of the voltage in a mode v at 
position z resulting from a single Fourier component of the optical 
excitation at frequency w at position 2 = 0. (We can use Fourier 
components since the guide is linear in voltage.) 


0A,(a, 2) 


aE = x Axo, 2) Kves(Z) exp [7(B. _ 8,)2], (2) 


where -4,(w,z) (the complex amplitude in mode v) = A,(w, 2) 
x exp [—78,(w)z ], 


By 


propagation constant for mode v (which is a function of 
w), and 


Ky»f (z) = coupling coefficient between modes p and v (K,, = —K»,). 


If the input power excitation is a function of time, then the average 
power response at position z is given by the average of the square of the 
complex envelope, a,(z, t), at position 2, i.e., 


(soz, t)) = (|av(z, t) |?) = [iA + «, 2), (a, 2)) exp [7(w + o)é] 
X exp (—itwt)dwd(w +), (8) 


where the complex envelope a,(é, z) is the Fourier transform of -4,(, 2). 
We shall next derive eq. (1) by obtaining a differential equation for 
(0/dz) (s»(t, 2)). 


2.1 Perturbation theory approach 
Following techniques used previously by Marcuse,’ we first write 


F Alo + 0, 2)At(0,2)) = {| 2 Ale + 652) | Avlo,)) 


+ (Aslw + 0,2)| ZAtlo,2)]). (4) 
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Using (2) in (4) we obtain 


2 (Asw + 2, 2)AY(w, 2)) 
= (2 A,(w + o, 2) Ky f(z) 
-exp {7[8,(w + ¢) — By(w + o) }z}As(e, 2) 
+ (u A,(w + a, 2) An(w, 2) Ky f(z) exp {—7[8.(w) — B,(w) ]z}). (5) 
Next we recognize that the perturbation solution to (2) is given by 


Ad) = Adle’) +E KoeAul2) f f(e) exp [+i(6. — Bede. ©) 


We shall now substitute (6) into (5) with the following approxima- 
tions: We assume f(x) is independent of A,(z’)Aj(z’) for « > 2’. We 
keep only terms that are second order in K,,, because first-order terms 
would include the factor f(x) to first order and, since f(x) has been 
assumed independent of A,(z2’)A,(z’) for x > 2’, the expectation of 
these terms would vanish [ f(x) has zero mean ]. 

We obtain 


< (As(w + 0, 2) As(e, 2)) 


= LE slo + 0, 2A, 2) Kuakon f (SIO) 


“exp 218 u(w + ¢) — Bs(w + o) |(x — 2)dx 
-exp {7[8.(w + 0) — Bs(w + o) ]e} 
FED slo + 0, 245, 2) ) Kuk | (S@IO) 
ML z 
-exp {—7[8.(w) — Bs(w) ](x — z)}dx 
-exp {7/6.(w + o) — By(w + o) — Bo(w) + Bs(w) Je}, (7) 
plus two similar terms where w and w + o are interchanged and the 
conjugate is taken. 
In (7) we have again assumed independence between f(x) and 
A,(w, 2’) for x > z’. Now we assume that terms that are rapidly vary- 
ing in z can be neglected, ie., we neglect terms proportional to 


exp {z[6,.(w) — B»(w) Jz} unless » = v. We obtain for small z — 2’ [so 
that (A(w + a, 2)A*(w, 2)) & (A(w + a, 2’)A*(w, 2’)) ]. 


2 (A,(w + a, 2)Az(w, 2)) = — x (A,(@ + a, z)Aj(a, 2))| Kuo [PP 
+ X (Ay(w ae Oo; z)Az(w, 2))| Kuo [PF 
“exp {21 Bo(w am a) = Bulw ai a) _ By (w) Si Bu(w) Jz}, (8) 
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where F is the spectral height of the random process f(z), i.e., 


real’ (aso) 
-exp {i[B.(o + 0) — Bow + a) Me = )}dl@— 2) 9) 


is assumed independent of [6u(w + ¢) — 8,(w +c) ], which implies 
(f(a) f(2)) & Fox — 2). 

Next we assume that each mode has a well-defined group velocity 
within the frequency band of interest, that is, 


B.(w +o) ~ Blo) = G- (10) 
Substituting (10) into (8) we obtain, using (2), 
< (As(o + @; 2)Az(w, 2)) 
= £ ((Aslw + 6, 2)-Ai(w, 2)) exp (ia2/C,)) 
= — DL [Ky |?FeA,(@ + o, 2)eAG(w, 2)) exp (icz/C») 
£E [Kyl Ag (wo + 0, 2)eAL(w, 2)) exp (602/C,). (11) 
But , 
£ ((Aalw + 0; 2)AS(e, 2)) exp (io2/C.)) 
= exp (io2/C.) | 5 (Aslw + 0, file) 
+ (Alo + 0, 2)-Alw,2)|- 2) 


Substituting (12) into (11) and Fourier transforming as in (3), we 
obtain 


2 (elt, )) +H (lb) 
ai x | Kyo|?F(s,(t, 2)) + 2. | Kur[?F(su(é, 2)), (13) 
which is the same as eq. (1). 


2.2 Kronecker product approach 


Rowe and Young? assume two modes and write (in our notation) 


dArlos 2) = Kief(z) exp [7(81 — B2)z] Aa(a, 2) 
(14) 
Serle) = Ka f(z) exp [1(82 — 61)2] Ail, 2), 
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where 
Kis = Kar = 1K. 


They then assume that f(z) is the derivative of an independent in- 
crements process satisfying 


(f(z) f(z’)) = Fé(z — 2’) (Dirac delta). 


They divide the fiber into intervals (sections) of length A and assume 
that all the coupling takes place at the right end of each interval. They 
obtain the following difference equation: 


Ax(w, IA) = cos (Ke) Ail, (l — 1)A] + ¢sin (Ke) Aol, (I — 1)A] 
-exp [7(B1 — B2)lA] 

Ae(w, IA) = 7 sin (Ke,) Ail, (l — 1)A] exp [7(B2 — BIA] 
+ cos (Ke;)AeLw, (I — 1)A], 


(15) 


where 


c= f(z)dz. 


interval! 


From (15) we obtain 
(Ai(w + a, LA)Ai(w, [A)) 
= (cos? (Ker))(Ailw + o, (i — 1)AJAiLe, (J — 1)4)) 
+ (sin? (Ke1))(Aslo + o, (1 — 1I)AJA3[e, (J — 1)4)) 
-exp {7[Bi1(w + o) — Bi(w) — B2(w + o) + B2(w) Jz}. (16) 
In (16) we used the fact that c; is independent of Aifw, (J — 1)A] 
and A»[w, (1 — 1)A] because f(z) has been assumed to be the deriva- 
tive of an independent increments process. We also used the fact that 
terms like (sin (Kez) cos (Kcx)) equal zero because c; is a symmetrical 


random variable. 
In the limit as A > 0 (intervals {/} get small) we have 


(cos? (Ke;))) > 1— FK?A (sin? Ke,) — FK?A. 
We obtain from (16) 


S (Ailw + 4, AT, 2)) 
= —FK?(Ai(w + a, 2)Ai(a, 2)) + FK*(Ao(w + a, 2)A2(a, 2)) 
-exp {2[6i(w + o) — Bilw) — B2(w + 0) + B2(w) Je}, 
< (Ar(w + a, 2)As(a, 2)) 
= —FK?(Ao(w + ¢)A2(w)) + FK2(Ai( + o)Aj(w)) 
-exp {t[B2(w + o) — B2(w) — Bi(w + o) + Bilw) Je}. 
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This is the same as eq. (8) with Ky. = Koi = tK and Ky = Ke = 0. 
Thus, the coupled-power equations follow from (17) by Fourier trans- 
forming and making use of approximation (10). 
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Effect of Junction Capacitance on the Rise 
Time of LED’s and on the Turn-on Delay 
of Injection Lasers 


By T. P. LEE 
(Manuscript received April 29, 1974) 


The space-charge capacitance of the forward-biased junction has been 
found to play a major role in (2) the apparent rise time of the emission 
from small-area, high-radiance LED’s and (iz) the apparent turn-on 
delay of stripe-geometry DH-structure laser diodes. For a zero-bias 
capacitance of 200 pF, a typical value for such devices made by oxide- 
masking techniques, the measured rise time of an LED that is fully turned 
on and the turn-on delay time of injection lasers may be as much as twice 
the limitation imposed by the spontaneous carrier recombination time. A 
proposed method to reduce these delays by preshaping the driving pulse 1s 
analyzed, and a reduction of the delay by a factor of 2 or better is predicted. 
These results are in agreement with experiments. 


I]. INTRODUCTION 


Among the optical sources presently available for optical communica- 
tion applications, AlGaAs light-emitting diodes!” and injection lasers*-* 
are probably the most compatible with low-loss optical fibers.*.® The 
possibility of direct pulse modulation of the optical output of these 
devices by varying the driving current is a major advantage, and the 
intrinsic radiative recombination time suggests that very high modula- 
tion rates are possible. Fractional-nanosecond rise times and modula- 
tion rates to 100 Mb/s for LEp’s’-" and modulation rates to hundreds 
of Mb/s for injection lasers!!!” have been reported. Faster rise times 
and higher modulation rates are more difficult to attain. 

For an ideal LED, if the injected carriers arrive instantaneously at 
the recombination (diffusion) region, the rise time of the spontaneous 
emission is governed solely by the spontaneous recombination time of 
the carriers. However, in driving a practical diode, the junction capaci- 
tance and the stray capacitance cause delay in the arrival time of the 
injected carriers at the recombination region. Thus, the rise time of the 
spontaneous emission in an LED and the delay time of the stimulated 
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emission in a laser diode would be either (2) material-limited by the 
spontaneous recombination time 7, or (iz) circuit-limited by the time 
constant 7. of the driving circuit (including the junction capacitance 
of the diode). The space-charge capacitance effect has been reported 
previously in large-area GaAs LED’s!® and in GaAso,¢Po.4 LED’s.!° 

In a previous report, Dawson and Burrus” measured the rise time 
of small-area GaAs LED’s.!? They showed that the rise time decreased 
with the capacitance of the unit. For instance, a GaAs diode with a 
junction capacitance of 200 pF had a rise time of 5.5 ns; when the 
capacitance was reduced to 20 pF by etching away the excess area of 
the junction, the rise time was reduced to 3.5 ns. They also observed 
that the rise time decreased with driving current. More recently, in 
doing the modulation experiment for LED’s, Dawson employed a low- 
impedance driver to improve the speed of modulation. A rate of 48 
Mb/s”! with approximately 1-ns rise time was obtained using a driver 
with an output impedance of 2 to 4 ohms. At the higher rate of 280 
Mb/s” a snap-diode driver with an effective impedance of 1 to 2 ohms 
was used to produce a train of short pulses. The apparent rise time of 
the observed optical pulses was 0.7 ns. However, the peak power was 
less than that for de drive, since the LED was not fully turned on in 
this case. Also, White and Burrus? used a low output-impedance tran- 
sistor driver that produced a light pulse with 2.5-ns rise time. 

In view of the previous experimental results, the present work studies 
the junction capacitance effect in detail and correlates theory with 
experiments. The effect on the turn-on delay of injection lasers is 
studied similarly. The expected reduction of such delays by means of 
a preshaped driving pulse is calculated. 

In practical applications for high modulation rates, a combination 
of the following techniques can be used: (2) constructing diodes to have 
low capacitance, (7) employing drivers with low output impedance 
(to match the diode impedance) and (277) properly shaping the driving 
current pulse. 

One additional point in the results of this work is that, in determi- 
ning the spontaneous recombination lifetime by the measurement of 
laser turn-on delay,” good results can be expected if (7) the junction 
capacitance is less than about 10 pF and (72) the laser is operating in 
a single filament.%?> The latter requirement is necessary because the 
distribution of current among filaments and the determination of the 
threshold become ambiguous in diodes lasing in multifilaments. 


I. DESCRIPTION OF THE DIODES AND THE EQUIVALENT CIRCUIT 


The LED used in the study was a Zn-diffused n-type GaAs 
(Na > 3 X 10!8 cm-’) diode made by C. A. Burrus.!? In this structure, 
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p-n junction is formed in the entire wafer of about 1.44 X 107? cm? 
area, and the SiOz layer masks most of the pt surface except for a 
small window where the contact is made to the junction. The current 
is restricted to flow largely in the small contact region, resulting in a 
small emitting area of 2 — 3 X 10-° cm?. In our investigation of the 
rise time, the current density in the primary emission region ranged 
from 5 kA/cm? to 80 kA/cm?. 

The laser diode we used was a GaAs—Al,Gai_, As double-heterostruc- 
ture device with a stripe contact.?® This geometry forces the current to 
flow only in a narrow region, thereby confining the laser actions to the 
small part of the junction below the contact. The wafer had an area 
about 2.5 X 10-3 cm?, whereas the stripe contact area was about 
1.25 X 10-4 cm? (500 um X 25um). Room temperature threshold 
current density was about 3.5 kA/cm?. 

The common feature of both diodes is that the large junction over 
the entire wafer presents a space-charge capacitance comparable to 
the diffusion capacitance of the junction. 

Figure 1 shows the equivalent circuit for study of the transient 
behavior of both diodes. A nonlinear resistor represents the J — V char- 
acteristic of the diode, described by 


ia = To| exp (24) - 1], (1) 
where vq is the junction voltage. The factor n is unity for an ideal diode 
diffusion current but, in our heavily doped p-n junction in GaAs, n 
is approximately 2. In the transient case, part of the injected carriers 
will replenish the carriers that recombine, and the rest will build up 
as stored charges. Thus, the stored charges will increase as (1 — e7*/7*), 





Fig. 1—The diode equivalent circuit. 
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where the time constant 7, is the carrier spontaneous recombination 
time. This is equivalent to a capacitance, Cz, whose charge, qa(t), is 
equal to the stored charge and whose instantaneous charge-voltage re- 


lationship satisfies 
= qa \ _ 
qa = Qo | exp ( #) | ? (2) 


where vg is the instantaneous junction voltage, and Qo = Io7;. To 
satisfy eq. (2), the diffusion capacitance is defined as 


dva Ts da (3) 
In addition to the diffusion capacitance, there is the space-charge 
capacitance of the depletion layer. The space-charge capacitance, in 
parallel with Ca, is given by the usual form 
_ (A-—Ade A Lo 
C= 1 pae/6" + 0 vale) @) 
where c, is the zero-bias capacitance per unit area, ¢ is the barrier 
voltage which is 1.26 volts from C — V measurement, the exponent m 
is 4, A is the total area, and A, is the emitting (contact) area. At small 
forward voltages, the current spreads over the entire area of the diode 
chip. The current confinement in the contact area is obtained only at 
a voltage va., at which the junction resistance is less than the spreading 
resistance. Thus, the first term in eq. (4) accounts for the space-charge 
capacitance of the junction surrounding the emitting area, while the 
second term is the space-charge capacitance of the emitting area itself. 
Since A > A, and vg & vao, eq. (4) can be approximated by 


Cy 
(1 ZZ Vao/b)™ : 


where Cy) = Ac, is the total zero-bias capacitance. 

Referring to Fig. 1 again, resistance R, in the equivalent circuit 
accounts for the series resistance of the bulk material as well as the 
contact resistances. The driver output impedance is represented by 
k,, and V, is a rectangular voltage pulse amplitude. I’or completeness, 
the following circuit equations are included: 





C, = (5) 


caw) 

hee, rs 
ic =t—%a (7) 
aa eae (8) 
oI) C, + Ca" 
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Table |— Constants for diode parameters 


Io n* m R,; R, Co 
LED 2.3 X 10-” 2 0.5 5.6 Q 50 Q 150-250 pF 
Laser 1.2 x 10-* 2 0.5 0.6 Q 50 2 130-200 pF 


* For ideal diodes, n = 1 for the diffusion current at low injection and n = 2 at 
high injection. In our GaAs with heavily doped n-region (Na > 3 X 10% cm™*), n 
is usually equal to 2. 


Numerical solutions to the system of eq. (1) through (8) have been 
obtained using the experimentally determined constants for both LED’s 
and laser diodes, as tabulated in Table I. Since the spontaneous emis- 
sion is proportional to the diffusion current, the time dependence of 
emission can be found from the time response of 24. 


ill. EXPERIMENTAL SETUP 


The diode was mounted at the end of a 50-ohm microstrip line (in 
series with either a 50-ohm resistor for the laser diode or a 47-ohm 
resistor for the LED chosen experimentally to minimize the reflections), 
and was driven by a Tektronix” 110 pulse generator that produced a 
rectangular pulse with a rise time less than 250 picoseconds and ripple 
below 1 percent. The pulse width used was 20 ns, and the repetition 
rate was 300 p/s. The output was detected by a p-i-n photodiode 
with a rise time better than 150 picoseconds and displayed on a 90-pico- 
second rise-time sampling oscilloscope. 


IV. NUMERICAL RESULTS AND COMPARISON WITH EXPERIMENTS 
4.1 Characteristics of LED response 


The diffusion current, as a function of time, resulting from a step 
input voltage to the LED can be obtained by solving eqs. (1) to (8). 
Using the measured diode parameters given in Table I, we obtain the 
dependence of the rise time, between the 10- and 90-percent points of 
the light pulse, and the delay time between the application of the 
current pulse and the 10-percent point of the light pulse. These quanti- 
ties are functions of driving current density and the junction capaci- 
tance. Such dependence for various values of zero-bias space-charge 
capacitance is shown in Fig. 2a. It is seen that, for very large current 
densities, the rise time is unaffected by Co and approaches an asymp- 
totic value of 27,, slightly faster than an ideal exponential rise of the 
form (1 — e~*/ts). The delay time is largely due to the space-charge 
capacitance as clearly shown in Fig. 2b, and it approaches zero at very 


* Trade name of Tektronix Corporation. 
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Fig. 2—(a) Rise time of LED output. (b) Delay time of Lep output as a function of 
current density for various zero-bias diode capacitances. 


large current densities. This is understandable since, for very large 
current densities, the space-charge would be completely neutralized. 
Such neutralization would require a current density on the order of 


= Bate (9) 


: Ts A,” 
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Fig. 3—Comparison of calculated total rise time (ta + ¢-) with measured values. 


where W, is the space-charge width at zero-bias, Na is the net donor 
density, A is the total junction area, A, is the light-emitting area where 
the current flows, Wo = eoerA/Co = 0.66 X 10-° ecm (for A = 1.2 
X 10-*em, Co = 200 pF, and e, = 12.8). If weuse Na = 3 X 10!8 cm 
and r, = 1.4 X 10-°, we see the estimated current density is 9 X 10° 
A/cm?, a value in good agreement with the curves of Figs. 2a and 2b. 

The experimentally measured total rise time (tg + t,) indeed showed 
a current dependence similar to those calculated above. The spon- 
taneous recombination time 7, can be determined by noting that, at 
large current densities, the rise time approaches 27,, independent of 
the junction capacitance. Figure 3 shows the experimentally observed 
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current-dependent total rise time normalized to 7, plotted against the 
current density. It is clearly seen that the experimental points can be 
matched to a calculated curve if Co is assumed slightly larger than 200 
pF. This value of Cy isin very good agreement with the measured value. 


4.2 Laser turn-on delay 


Below lasing threshold, the accumulation of injected carriers in a 
laser is similar to that in an LED. Thus, the spontaneous emission from 
a laser has a similar response to the emission from an LED. When the 
current increases to a level at which the laser has gain equal to the total 
loss of the cavity and mirrors, laser action commences and the light 
output increases sharply. The stimulated emission reduces the spon- 
taneous lifetime considerably, so that the rise time of the stimulated 
emission is much shorter than that of the spontaneous emission. Thus, 
the delay time of the stimulated emission is just the response time 
required for the diode current to reach the lasing threshold. This delay 
time can be determined as a function of J/(J — In) by calculating the 
response of the diffusion current for experimental diode parameters and 
by determining experimentally the threshold current of the laser. 

Curves of delay vs current above threshold for various values of 
zero-bias diode capacitance are shown in Fig. 3. For Cy = 0, the rela- 
tionship is a straight line given by the usual delay-time formula” 


la ea Er In (10) 


I 
TT 
It is clear that the space-charge capacitance will further delay the laser 
turn-on time. For large Co, the relationship deviates from the straight 
line, a phenomenon observed in past experiments* that has not been 
explained adequately. For Co = 200 pF, a typical value for the experi- 
mental diode, the delay time is about twice that for zero capacitance. 

To determine 7, experimentally from delay measurements, there- 
fore, a correction for the effects of junction capacitance must be made. 
Using C, = 200 pF, the corrected 7, would be 1.8 X 10~° for our par- 
ticular laser diode. The measured delay time tz, when normalized to 
7, as a function of I/(I — Ii), agrees well with the results calculated 
in this way. 


4.3 Effect of driver output impedance 


It is obvious that the resistance-capacitance time constant of the 
charging current can be shortened by either reducing the space-charge 


*In past experiments, the result that the straight line did not pass the origin was 
attributed to filamentary lasing or multimoding (Ref. 24). Perhaps the effect of the 
capacitance is also contributing to the undesirable results (Ref. 25). 
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Fig. 4—Laser turn-on delay (normalized to the spontaneous recombination time 
7s) as a function of excitation currents above threshold for various zero-bias diode 
capacitances. Solid lines are calculated. 


capacitance* or lowering the output impedance of the driver.*!'” Using 
the same diodes, we varied the impedance of the driver by shunting the 
diode with a low resistance. Figure 4 shows a typical result of a laser 
diode. The turn-on delay time of the laser as a function of the shunt 
resistance for various driving currents and the luminescence rise time 
when the laser diode was operated below threshold were measured. To 
assure that the same amount of current was injected into the laser 
as the shunt resistance was changed, the light output was monitored 
first for 50 ohms (no shunt). For other shunt resistances, the driving 
current was increased until the same output was obtained. For a given 
diode current, the delay was shortened by lowering the driver imped- 
ance. The amount of reduction in delay reduced as the diode current 
increased. This is in line with the results indicated in Section 4.1 that 
the effect of space-charge capacitance decreased at higher currents. With 


*J. C. Dyment (Ref. 25) has observed independently that proton-bombarded 
stripe-geometry lasers with shallow penetration have delay times almost 1.5 times 
that of units with deep penetration. The unit with shallow penetration had a capaci- 
tance of about 75 pF, while those with deep penetration had 15- to 20-pF capacitance. 


LED JUNCTION CAPACITANCE 61 


a 1-ohm shunt resistance, the capacitance effect should be negligible. 
The results in Section 4.1 showed that the luminescence rise time would 
be about 2.27,. Thus, we can deduce from the rise time (4.1 X 107° s) 
that the spontaneous recombination time 7, = 1.85 X 10~ s. Again, 
it agreed with the delay time measurement mentioned in Section 4.2. 


V. EQUALIZATION OF LASER TURN-ON DELAY 


As discussed above, the accumulation of charges in the recombina- 
tion region is approximately exponential with an effective time con- 
stant ver. This time constant may be larger than or equal to the spon- 
taneous lifetime 7,, depending on whether the space-charge capaci- 
tance is significant. To reduce the delay from 7, a driving current 
with amplitude many times higher than J,, would be necessary, and 
this would require a driver with high power capability if its output 
impedance is larger than that of the diode laser, which is about a few 
ohms. An alternative is to preshape the current pulse to equalize the 
delay. One of such a pulse shape, the exponential decay shown in Fig. 
5a, has been investigated. 

Using the exponentially decaying pulse, J = J,e~*/7* + Ip, shown 
in Fig. 5 as the driving current pulse, and using the same approach as 


LASER OUTPUT LUMINESCENT OUTPUT 
/ DELAY TIME \ RISE TIME 


3Ith 2Tth 1.5 lth 1.2Ith 0.43 Ith 


Joo 


SHUNT RESISTANCE IN OHMS 





0 1 2 3 4 5 6 7 8 9 10 
TIME IN NANOSECONDS 


Fig. 5—Laser output delay time and luminescent rise time as a function of the shunt 
resistance for various excitation currents. 
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(b) 


Fig. 6—(a) Current waveform for delay reduction of laser diodes. (b) The pulse 
shaping network used in the experiment. 


in Ref. 23, the laser turn-on delay can be derived as 


Hie POPES 2 ole — A 0 fe SO Vip ah 
Tn l= T/T (Arcpete)s a 


where J, is the amplitude of the exponential portion of the current 
pulse, J) is the constant portion of the current pulse, and J+, is the 
threshold current with a rectangular current pulse. Tg = ta/Ters is the 
normalized delay time, and T., = 7-/7erz. Since the parameters J, and 
T, are independent, we considered two cases: (7) Set r-/7ere = 1, and 
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find the decrease of tg with Iq. (77) Set Ig = J» and find the decrease of 
ta with 7,. The results are shown in Figs. 6a and 6b, respectively. In 
Fig. 7a, for each Jo, the curves start on the Y-axis at a value of ta/Tets 
that equals the normalized delay for a rectangular pulse given by (10). 
As I, increases, the delay decreases and approaches 0.1 ver: at Ja = 8 
I,,. This implies that a sharp spike in the driving current pulse would 
help considerably in reducing the delay. 

Figure 7b shows the reduction of ¢¢ as a function of T'.(= te/Ter:). 
For a very small value of 7.., the exponential portion of the current is 
too fast for the charge to respond, and the reduction of tg is marginal. 
For T, > 1, ta asymptotically approaches 0.287 rer at Ja = Ig = 2 Tin. 


td/T oss 


To/Ith=1.5 





I3/lth 


Fig. 7a—Reduction of delay time as a function of the peak pulse amplitude. 
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Fig. 7b>—Reduction of delay time as a function of the pulse decaying time constant. 


At T. = 1, ta = 0.316 rere. Thus, a factor of 2.2 reduction in tg can be 
realized with this shaped pulse compared to that obtainable with a 
rectangular pulse. 

Experimentally, we have realized the pulse shape in Fig. 6a by 
shaping a rectangular pulse with the network shown in Fig. 6b as an 
example. This pulse shape not only serves to reduce delay, but its 
negative after-pulse serves to turn the laser off faster than a rectangular 
pulse. The resistors R; and Rz are necessary for matching to our 50-ohm 
measuring system. The capacitor C, may be chosen to provide differ- 
ent time constants. Table II summarizes the experimental and cal- 
culated results, and it can be seen that the agreement is excellent. 


Vi. CONCLUSIONS 


Both theoretical and experimental investigations of the effect of 
junction capacitance on the rise time of LED’s and the delay time of 
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Table Il— Comparison of experimental and calculated 
delay reduction 


sic a ae ule, Exponential Pulse, Zz = Ip = 211, 

I/It 

Teft ta(I G th) be R, R, C, | fe nae ta Cal- 
ns 


me) (ns) | (2) | (@) | @F) | “Gs gene 





3.8 2.4 0.25 100 82 5 2.3 2.2 
0.5 100 82 10 1.6 bs Ie’ 
1.95 100 82 39 1.2 1.3 
3.9 100 82 79 1.0 1.1 


injection lasers lead to the conclusion that the apparent response delays 
in these devices can be attributed largely to the shunting capacitance 
of the junction itself. The effects are greatest in devices operating at 
relatively low currents. Thus, in the more efficient double-heterostruc- 
ture devices, the response time is much larger than would be expected 
from the spontaneous recombination time of the carriers. Homostruc- 
ture devices, normally operated at much higher current densities, are 
affected less.?7 

Practically, the effects can be minimized by constructing devices to 
have low capacitance (by isolation of the active junction through the 
use of mesa structures or proton bombardment, for example) and by 
providing the lowest possible impedances in the driving circuits. 
Furthermore, in certain systems applications where short delay is 
required, a simple delay equalization (in the form of a shaped driving 
pulse) can be employed in conjunction with a driver that has low- 
output impedance. 

It should be pointed out, however, that our definition of rise time is 
the time required for the light output to reach 90 percent of its fully 
developed steady-state value, when the diode is driven by a step cur- 
rent. In many practical applications, the LED can be modulated at 
much higher speed than that imposed by the spontaneous recombina- 
tion time.?® In such cases, the light output would not reach the steady- 
state value before the driving pulse is off, resulting in an apparently 
narrower output pulse but smaller amplitude. The price one pays, of 
course, is the reduced driving efficiency.” 

In addition to the effects of the capacitance on the pulse modulation 
of these devices, it is also pointed out that great care must be exercised 
in determining the semiconductor carrier recombination time by mea- 
surement of the delay time of junction lasers. To assure good results, 
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the junction capacitance must be less than 10 pF, and the device must 
be operating in a single filament. 
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A Depletion-Load, p-Channel, Bipolar-IGFET 
Technology 


By G. MARR and G. L. MOWERY 
(Manuscript received April 25, 1974) 


A high-performance, depletion-load, bipolar-IGFET technology 1s 
described. The optimization of device and circuit parameters, the ton- 
implanted depletion-load fabrication process, and the high-speed input 
and output circuits, which allow direct interface with the TTL circutt 
family, are discussed. 


I. INTRODUCTION 


Since the development of the low-threshold (Vr = —1 volt) metal- 
gate p-channel Al,.0O;-SiO2 IGFET technology,! further technological 
advances have been made to improve circuit speed and interface flexi- 
bility of 1aFET circuits. Two significant improvements have been the 
incorporation of bipolar-IGFET (BIGFET)? devices on the same mono- 
lithic silicon-integrated circuit (stc) and the application of ion im- 
plantation to fabricate depletion-load 1¢rEet’s.’ These have resulted in 
an IGFET technology whose circuits are completely compatible with 
the standard 5-volt bipolar rrz family in terms of input and output 
levels as well as power supply. Furthermore, the relatively high func- 
tional packing density of the 1g¢FmT technology makes it an extremely 
attractive option in MsI/LsI applications. These have been successfully 
realized in the design and fabrication of several Bell System catalog 
and custom circuits. 

In this paper, the technology characteristics of the depletion-load 
BIGFET technology are described. 


Il. DEPLETION-LOAD INVERTER 
2.1 Circuit characteristics 

The advantages of the depletion-load inverter over a similar all- 
enhancement or resistor-load inverter become obvious when we con- 
sider the various load-line characteristics. In Fig. 1, the load lines of 
a resistive load, an enhancement-IcFreT load, and a depletion-IGFET 
load are shown over the family of IV curves for the enhancement- 
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Fig. 1—Inverter load-line characteristics. 


IGFET driver (Qp). The three load lines are chosen to intercept at the 
same point in the Vi, = Vpp and V;, = 0 characteristics curves to 
reflect the fact that they will produce the same output de levels. The 
area under the load line represents the transient power dissipated 
during one logic transition. For a given capacitive loading on the in- 
verter output node, the transient power dissipation is proportional to 
the frequency of operation. Therefore, the area under the load line is 
directly proportional to the maximum operating frequency of the 
inverter. The speed advantage of the depletion load is apparent because 
it has the largest area between Vout = Von and Vout = Vost. 


2.2 Circuit equations 


In this section, the approximate circuit equations of the depletion- 
load inverter are reviewed. In spite of their inexactness, they are useful 
because they provide insight into the functional dependence of circuit 
parameters on device parameters. However, the final choice of device 
parameters, discussed in Section 2.3, was based on more exact com- 
puter calculations. 

Assuming the gradual-channel approximation, the output levels 
(Von and Vos) and the corresponding single-stage noise margins (V m1 
and V y2) of a depletion-load inverter with Qp on and off, respectively, 
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Vu Vu2 


Fig. 2—Depletion load inverter. 


as shown in Fig. 2, may be given by? 


Vi 
Vou 2(Vpp — Vro) 
Vorr = Von 
Vai = |Ver — Von| 
Vaz = |Ve2 — Vortl, 
where 
Va = peed + Vorp 
(1 + Br)? 
Ver = Vi-Vq + Vero 
y2 
and where 


Vero, Vrz = threshold voltages of Qp and Qz, respectively, 
Von 


I 


supply voltage, 


(1) 
(2) 
(3) 
(4) 
(5) 
(6) 
(7) 


Bp, BL, Bk = IGFET gain parameter of driver and load and their ratio. 


The propagation delay (tz) is directly related to the inverter-output 
rise time (¢,) and fall time (¢;). Since the rise-time charging current is 
given by J, = (81/2)-(V7rz)?, t- may be approximated as 


BLV or ’ 





t, 
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where C is the output capacitance and, neglecting interconnection 
capacitance, may be expressed in terms of Bp as 


o = BER a + my, (9) 


where Ip is the channel length of Qp, » is the mobility of carriers in 
the channel, and F is the number of similar inverter gates to be driven 
(fan-out). If interconnection capacitance is expressed as equivalent 
fan-out, eq. (9) is still applicable. Substituting (9) into (8) results in 


= 2V pvB eld, 


ty 
be Vin 


(Le): (10) 


Likewise, since the fall-time discharging current, 
I; = (8p/2)(Vop — Vro)? — (81/2) Vix, 
the fall time of the output voltage may be approximated as 


hes 2Vpvlp(1 + F) 
tpl (Von — Veo)? — V2z/BR | 


Since ¢a is proportional to (¢t, + t;), we observe from (10) and (11) that 
ta is directly proportional to [3, F, and Vpp. A more important ob- 
servation, perhaps, is that as V%,/8x increases, t, decreases and ty 
increases. Thus, depletion-load inverters designed for optimal speed 
have ¢, and t; nearly equal. 

A final useful relationship is the power-delay product of the inverter. 
Using the average power for Pp for small V},/8p, 


VovC bolpBn(1 + F) 
I, KB : 


(11) 





Ppta ~ Pot, ~ (Voo!s)( ) ~ Vin’ ~ 
where Cp is the gate capacitance of Qp, and Ppta is completely inde- 
pendent of Vr,. However, for larger values of V2,/Br, 


VipBolp(1 + F) 
: | Br(Vop — Varo)? _ i 


2 
Von 


Ppta ~ Pots ~ 


and Ppta increases quadratically as |Vrz| increases until |Vrz| 
~ VBe| Von => Vrp|. 

As we stated earlier, the preceding approximate equations provide 
insight into the dependence of circuit parameters on device parameters 
for a depletion-load 1GFET inverter. This is appropriate for characteriz- 
ing the properties of any depletion-load 1GFET random logic circuit, 
normally comprising an assortment of n-input NAND and NoR gates, 
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because both these logic gates can be analytically reduced to an 
equivalent depletion-load 1GFET inverter. 


2.3 Selection of Vrz and Br 


In this section, the method of determining the parameters of the 
depletion-load 1agFeT and the corresponding Sr are discussed. Since 
the equations developed in Section 2.2 are only approximate, more exact 
calculations were made using the circuit analysis program SERVICE.® 
In this way, the voltage dependence of Vr, was properly accounted for. 

The objectives and constraints of the technology were chosen to be: 


(t) Single-supply voltage Von = —5 + 0.5 volt. 
(it) Vrp = —1 + 0.4 volt at end of life. 
(itt) |Von| S |Vrol. 

(iv) Vari and Vy 2 0.5 volt. 


The problem is then, simply: Given these conditions, determine Br 
and Vr, such that tg is nearly a minimum for an assumed typical fan- 
out of five from each inverter. The net result is a family of tg vs. Vrz 
curves. Since Von, Vai, and V we are functions of V rz, a different Br, 
the minimum value consistent with conditions (277) and (iv) was 
chosen for every value of Vrz used. A nominal ta vs Vrz curve is 
shown in Fig. 3, where the Ppta product is also plotted against the 
same Vr, scale. As can be seen from the curve, the value of t¢ reaches 
a nearly minimum constant value for Vr, ~ 4.5 volts, when t, ~ ¢;. 
We may further note that the Ppta product increases approximately 
quadratically as Vr, increases. Based on this type of plot of all com- 
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Fig. 3—ra and Pay-ra vs. Vrz curve. 
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Vr = +4.5V 
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Fig. 4—Depletion-load inverter transfer curve. 


binations for different limits of Vpp and Vrp, an optimal Vr, = 4.5 
+ 0.5 volt was chosen, corresponding to a minimum design Gp of 5. 

Figure 4 shows a calculated and a measured de transfer curve. The 
good agreement between the two is an indication of the validity of the 
present work. 


ll. DEPLETION-LOAD FABRICATION AND CHARACTERISTICS 


A proven method of fabricating depletion 1aFrt’s is the use of a 
selective ion implant through the gate dielectric in the 1¢rreT channel 
regions. To achieve good control of the device properties, it is desirable 
to have the majority of the implanted ions in the silicon rather than 
in the gate dielectric, which requires 120 keV for the boron ions used. 
The resulting device no longer behaves exactly like the normal IGFET 


Vorain + Ves < Voate < Ves 
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Fig. 5—Depletion 1GFeT cross-sectional view. 
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BORON AT 120 keV 
ASSUMED B, = 5.25 umho / VOLT 


Vr’ IN VOLTS 
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Fig. 6—Effective threshold vs. implant dose. 


in which conduction takes place only in the surface inversion. It is 
actually a deep depletion IGFET with a subsurface channel, whose con- 
ductivity is modulated both by surface depletion and by the channel 
to substrate junction as shown in Fig. 5. The current-voltage charac- 
teristics of such a deep depletion 1a@FET have been previously described.* 

To achieve a reproducible profile of the ion-implanted dopants in 
the silicon, it is desirable to minimize the duration and extent of high 
temperature anneal treatment after the ion implantation. Since all 
the diffusion processes take place before the gate oxidation, the latter 
is the last high temperature step. Therefore, ion implantation is per- 
formed after the gate oxidation process. The photomask step for the 
boron implantation utilizes a thick photoresist layer (1.4 um) that is 
opaque to the 120-keV boron ions. Furthermore, since the required 
implantation dose of 7.6 X 10 cm-? is rather low, no damage to the 
gate dielectric is either expected or observed in the fabricated samples. 
The needed anneal® of implanted species is achieved in a subsequent 
process step when the masking SiO2* is deposited over the whole wafer 
at 900°C for 20 minutes. 

Figure 6 shows the effective threshold V7 = v(2J/8) as a function 
of the implantation dose of boron at 120 keV. The current is measured 
when gate and source are common and the drain biased at —5 volts. 
We observe that the V; — Q curve is essentially linear over a fairly 
wide range of dose (2 X 10" to 9 X 10" cm7). 


"The metal-gate iareT (Ref. 1) has a double layer Al2O;-SiO2 for its gate di- 
seule The deposited oxide is needed to serve as a mask when Al.Q; is selectively 
etched. 
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IV. INTERFACE CIRCUITS 


To allow realization of the speed advantages offered by the depletion- 
load and BIGFET technologies, two buffer circuit configurations have 
been developed to facilitate efficient interface between inputs and 
outputs of BIGFET sIc’s as well as to inputs and outputs of TTL(L) 
circuits. These two circuits are the level-shifting input circuit and the 
low-impedance output driver. 


4.1 Input interface 


It is necessary that the input voltage to an IGFET inverter be within 
one device threshold of substrate potential to assure complete turn-off 
of that inverter. Since this requirement is not met for the 1-volt thresh- 
old p-channel technology by either TTL or BIGFET output levels, the 
input level shifting circuit has been designed to accept these logical 1 
levels. The schematic of the input circuit is shown in Fig. 7. The corre- 
sponding dc transfer curves are shown in I'ig. 8. The internal feedback 
configuration of the level shifter produces a very sharp gain charac- 
teristic that lowers the required level for logic 1 and also provides in- 
creased noise margin and input signal wave shaping. In the design of 
this circuit, both the 6 ratio of Q1 to Q2 and the B6 ratio of Q3 to Q4 
strongly affect the acceptable input voltage range. The 6 ratio of Q1 
to Q3 is not critical, but may be chosen to produce minimum propaga- 
tion delay of signals through the input level shifter in a specific 
application. 

A further constraint on the 6 values of Q1 and Q2 is the fan-out 
desired from TrL(L) outputs and from BIGFET outputs into this level 
shifting input. During a 1 to 0 transition of the input voltage, IGFET’s 
Q1 and Q2 conduct current from the substrate to the input node. The 
requirement of conducting this current to ground while maintaining 
a valid 0 level at the input determines the fan-out from a given output 
to the level shifting input circuits. Maximum 0 level Vin to the level 


IN B (Q1) = 12 ¢smhos/V 
B (Q2) = 12 umhos/V 
ra B (Q3) = 6 umbhos/V 
Qi Q3 B (Q4) = 30 umhos/V 
iN TO 
CIRCUIT 
Q2 a4 
+Vss 


Fig. 7—Inverting level shifting input circuit. 
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Vsg = +5.0V 
Vop = 0.0V 
Vr= -1.0 V 
Vr_ = +4.5V 


TEMP. 0° C 
———— TEMP. 85°C 


LEVEL SHIFTED Vy TO CIRCUIT IN VOLTS 





Vin IN VOLTS 


Fig. 8—Inverting level shifter transfer curves. 


shifting input inverter is Vss — 3.4 volts, which is 1.1 volts for Vss = 4.5 
volts. Allowing 0.4-volt interface noise margin requires that the input 
voltage be <0.7 volt for the 0 level. The rru(L) output is rated at 28 
mA to ground at 0.7 volt. The level shifting input inverter of Fig. 7 can 
conduct 6 X 4.2 vA during a 1 to 0 transition, where 8 = fai or Baz 
is expressed in micromhos per volt. Therefore, the fan-out from TTL(L) 
to level shifting inputs is fan-out = 6667/8, where 8 = Bai or Baz. 
Similarly, the BigrFeT output with maximum output resistance of 2.1 
kilohms can conduct 0.333 mA at 0.7 volt. Therefore, fan-out from 
BIGFET outputs to level shifting inputs is fan-out = 79/8, where 
B = Baqi or Baz. 

The circuit of Fig. 7 has been used in various depletion-load circuits 
for both custom and general-purpose applications. It provides 


Min. Max. 


Vin O 0.0 Vss — 8.4 volts 
Vin 1 Ves — 2.2 Vss volts. 


4.2 Output interface 


The BIGFET output circuit configuration is shown in Fig. 9. This 
circuit provides an ideal coupling between the high-impedance IGFET’s 
and the desired low-output impedance. The current gain provided by 
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+Vss 


B(Q1) = 30 umhos/V 
8(Q2) = 3 umhos/V 
B(Q4) = 20umhos/V 


Vout Q3 BIGFET EMITTER AREA 0.5 mil2 
O Q5 BIGFET EMITTER AREA 2.4 mil2 


Rout = 1.45 KQ 





Fig. 9—BIGFET Sic output circuit. 


the Darlington Biarer structure permits the gate capacitance of Q1 
to be as small as that normally used internally in an IGFET circut. Since 
average power dissipation of the output circuit is « (Rout)~', it is 
desirable to make Rou: large. But the maximum allowed value of Rout 
is determined by the input characteristics of TTL(L), —240 ya at 0.4- 
volt input logical 0. Consequently, Rou: is nominally 1.45 kilohms, and 
the other device values are as shown in Fig. 9 for this BigrET output 
configuration with fan-out = 1 capability to TrL(Z) circuits. 

The value of Rout, the Vp. voltage drops of Q3 and Q5 operating as 
a Darlington pair, and the BIGFET emitter area of Q5 are designed to 
prevent Q5 from entering the saturation region, which would sig- 
nificantly increase the turn-off time of Q5. 

The circuit of Fig. 9 has been used in various depletion-load circuits 
for both custom and general-purpose applications. It provides 


Min. Max. 
Vout 0 0.0 0.4 volt 
V oak 1 Vgs — 1.7 Vgs — 1.2 volts. 


V. SUMMARY 


The p-channel depletion-load sigrer process has been demonstrated 
to be a successful technology for producing Biarer sic’s. Circuit be- 
havior has been approximately described using simplified circuit 
equations, and exact circuit behavior has been characterized using 
results from computer circuit analysis programs. Good agreement has 
been achieved between experimental results and circuit simulation. 
The application of ion implantation to the fabrication of depletion 
mode IGFET load devices has been described, and a correlation is shown 
between the effective threshold voltage and implant dose. Finally, the 
input and output buffer circuits, which allow interface between BIGFET 
sic’s and TrL(L) circuits, are presented. 
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A Speaker-Independent Digit-Recognition 
System 


By M. R. SAMBUR and L. R. RABINER 
(Manuscript received July 9, 1974) 


This paper describes an implementation of a speaker-independent digit- 
recognition system. The digit classification scheme is based on segmenting 
the unknown word into three regions and then making categorical judg- 
ments as to which of six broad acoustic classes each segment falls into. The 
measurements made on the speech waveform include energy, zero cross- 
ings, two-pole linear predictive coding analysis, and normalized error of 
the linear predictive coding analysis. A formal evaluation of the systems 
showed an error rate of 2.7 percent for a carefully controlled recording en- 
vironment and a 8.6 percent error rate for on-line recordings in a noisy 
computer room. 


l. INTRODUCTION 


With the widespread growth in the use of digital computers, there 
has been an increasing need for man to be able to communicate with 
machines in a manner more naturally suited to humans. The realization 
of this need has motivated a great deal of research in automatic recog- 
nition of speech by computer.!-* Although only a moderate degree of 
success has been obtained in solving the problems associated with 
machine recognition of continuous speech,’ a greater degree of success 
has been obtained in recognition of isolated words from a fixed vocab- 
ulary. The performance of these systems range from about 92 percent 
correct decisions for 561 isolated words by an individual for which the 
system has been carefully trained® to nearly error-free performance for 
the recognition of a limited vocabulary (e.g., the digits) also spoken 
by a speaker for which the system has been trained. However, per- 
formance of many of these word-recognition algorithms is radically 
degraded when the system has not been tuned to the speech character- 
istics of the individual user. The subject of this paper is an isolated- 
word, digit-recognition system that achieves high accuracy without 
having to be trained every time a different speaker wishes to use the 
system. 
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The development of a speaker-independent limited-vocabulary word 
recognizer is inherently more difficult than a speaker-adaptive system 
that can use comparatively simple pattern-matching algorithms to 
recognize the input words. It has been argued that the extended effort 
needed to design a speaker-independent system is unnecessary in view 
of the relative ease of training an adaptive scheme to learn to recognize 
the speech of a new user. There are two major reasons why such argu- 
ments are invalid. For small vocabulary systems (e.g., digit recog- 
nizers) with a large number of potential users, it is not feasible to store 
training data for every possible user. Furthermore, most systems can- 
not train themselves on new speakers very rapidly. Thus, the turn- 
around time for new users is often a major factor limiting the use of 
speaker-dependent systems. For a large vocabulary (250 words), the 
time required for a new speaker to form reference patterns for all the 
words in the vocabulary can be prohibitive. In addition, the variation 
with time of a speaker’s voice characteristics may necessitate frequent 
updating of his reference patterns. Finally, the design of a speaker- 
adaptive word-recognition algorithm is so dependent on the uniqueness 
of each talker that very little insight is gained in the actual problem of 
recognizing speech. On the other hand, it is hoped that the development 
of a speaker-independent scheme will contribute to an understanding 
of the acoustic attributes of speech that reliably distinguish the various 
sounds. Without such an understanding, it would be difficult to dupli- 
cate the human capacity of recognizing the speech of a wide variety 
of speakers. 

This paper discusses a speaker-independent digit-recognition system 
that was implemented on the computer facility of the acoustics research 
department at Bell Laboratories. Section II discusses the basic speech 
parameters that are measured and shows how the digits can be classi- 
fied from these features in a speaker-independent manner. This section 
includes a discussion of the various signal-processing techniques that 
are heavily relied on throughout the classification process. Section III 
discusses the digit-classification scheme. The classification procedure 
is a tree-like decision algorithm for which backwards tracing is allowed 
when one of the parallel-decision algorithms indicates a high prob- 
ability of error. Section IV gives the results of a formal evaluation of 
the recognition system. Finally, the paper concludes with a discussion 
of the strong and weak points of the system and suggestions for how it 
can be improved. 


Il. FRAMEWORK OF THE RECOGNITION SYSTEM 


Figure 1 is a block diagram of the overall digit recognition system 
that was implemented. Following endpoint alignment in which the 
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Fig. 1—Block diagram of the overall digit recognition system. 






interval containing the word to be recognized is carefully determined, 
the speech is analyzed every 10 ms to obtain zero-crossing rate, energy, 
two-pole model linear-predictive-coding (upc) coefficients, and the 
residual tec estimation error. To aid in making preliminary classifica- 
tion decisions, the speech interval is segmented into three well-defined 
regions. All the speech information is fed in parallel into a preliminary 
decision-making algorithm that chooses one of several possible digit 
classes for the input utterance—e.g., one class contains the digits 1 
and 9. A final decision is then made based on the presence or absence 
of certain key features in the input speech. 

In this section, we show how the various digits can be characterized 
in terms of certain acoustic features. Then we discuss some key signal- 
processing functions that are heavily relied on in the decision algo- 
rithms and that contribute strongly to making the system speaker- 
independent. 


2.1 Characterization of the digits 


The elemental speech units (phonemes) that comprise English words 
can be classified into two broad categories, vowels and consonants. 
The vowels can be further classified into front (/i/, /I/, /e/, /e/, and 
/ae/), middle (/3/, /A/), and back vowels (/u/, /U/, /o/, and /o/). It 
is also convenient to subdivide the consonants into the categories 
noise-like (fricatives, plosives) and vowel-like (nasals, glides). Table I 
gives a list of the sequence of phoneme categories for each of the ten 
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Table |— Sound classes characteristic of the digits (from Ref. 6) 


Digit Sequence of Sound Classes 


VNLC > FV — VLC — BV 

VLC — MV — VLC 

UVNLC — FV — BV 

UVNLC — VLC — FV 

UVNLC — BV —- MV 

UVNLC — MV — FV — VNLC 
UVNLC — FV — UVNLC 

UVNLC — FV — VNLC — FV — VLC 
FV — UVNLC 

VLC > MV > FV — VLC 


OONOORWNHO 


VNLC = Voiced, noise-like consonant. 
UVNLC = Unvoiced, noise-like consonant. 
VLC = Vowel-like consonant. 
FV = Front vowel. 
MV = Middle vowel. 
BV = Back vowel. 


digits, 0 through 9.6 Our approach toward speaker-independent 
recognition of the digits is to use a set of robust measurements to 
classify the phonemes into the six broad categories listed in Table I. 
By robust measurements, we mean acoustic parameters that give a 
general indication of the gross nature of each phoneme without being 
too dependent on the speaker’s voice characteristics. Through a com- 
bination of parallel processing and self-normalization, the phoneme 
categories are determined and the spoken digit is recognized. We now 
discuss the criteria for the selection of the robust measurements that 
are used, the technique of self-normalization of measurements, and 
finally the method of parallel-processing of the data to give a speaker 
independent classification of the digits. 


2.2 Robust measurements for digit recognition 


The requirements for a recognition parameter to be selected as being 
a robust measurement are: 


(t) The parameter can be simply and unambiguously measured. 
(11) The parameter can be used to grossly characterize a large pro- 
portion of speech sounds. . 
(4at) The parameter can be conveniently interpreted in a speaker- 
independent manner. 


Based on the above requirements, the zero-crossing-rate and spectral- 
energy parameters are excellent candidates for robust measurements. 
These parameters can be used to effectively characterize the general 
acoustic properties of the sound categories listed in Table I. For ex- 
ample, noise-like sounds have a relatively high zero-crossing rate, 
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relatively low energy, and a relatively high concentration of high- 
frequency energy. Thus, the noise-like sounds of any speaker can be 
characterized quite accurately based on these measurements. The term 
“relatively,’’ in the above classification of noise-like sounds, can be 
conveniently interpreted for a given speaker by a simple self-normaliza- 
tion technique discussed later in this paper 

To measure the distribution of spectral energy, a two-pole LPc 
analysis has been suggested by Makhoul and Wolf? as an excellent 
means of representing the gross features of the spectrum. Figure 2 
(from Makhoul and Wolf’) shows the results of applying a two-pole 
model to a variety of speech sounds. This figure is a comparison of the 
spectra of several speech sounds obtained directly from Frr spectrum 
measurements compared with the spectra of the best two-pole LPc 
fit to the spectrum. For a two-pole upc analysis, there is either one 
complex-conjugate pole or two real poles. In Fig. 2a, the spectra for 
the sound /sh/ as in the word “short” are plotted. For this example, 
the two-pole Lrc analysis gives a complex conjugate pole at about 3000 


/sh/ VOICE BAR 
3007 Hz 0 Hz 
O Hz 
25 Hz 
2510 Hz 


RELATIVE ENERGY IN dB 





FREQUENCY IN kHz 


Fig. 2—Comparison of FFT spectra and two-pole irc spectra for several speech 
sounds. 
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Hz—.e., the region of maximum energy concentration in the spectrum. 
In Fig. 2b, similar results are shown for the vowel /a/ where the major 
concentration of energy in the spectrum is around 800 Hz. In the ex- 
amples of Figs. 2c and 2d (a voice bar and the vowel /i/), the major 
concentration of spectral energy is around 0 Hz; thus, the two-pole 
LPC analysis gives two real poles in the right-half z-plane. From Fig. 2, 
it can be seen that the computed pole frequency gives a good indication 
of the location of the dominant portion of the spectral energy of the 
sound and can thus be effectively used to characterize sounds with 
relatively high-frequency or low-frequency concentrations of energy. 
For example, noise-like sounds are characterized by a relatively high- 
frequency spectral concentration of energy, while nasals and vowels 
generally have a much lower frequency for the energy concentration. 

Figure 3 (also from Makhoul and Wolf’) illustrates the dynamic 
behavior of the computed pole frequency of the two-pole model and 
the corresponding spectrogram of the utterance, ‘‘Has anyone measured 
nickel concentrations - - - .”” Examination of Figs. 2 and 3 shows that, 
for vowel-like sounds, the computed pole frequency is invariably 
situated somewhere between the first and second formants. In general, 
when F’; and F»2 are not too far apart and have comparable amplitudes, 
the pole frequency falls almost midway between the two resonances. 
Since F; does not usually have as much dynamic movement as F2, the 
computed pole frequency tends to follow the motion of the second 
formant. Since the motion of F2 is quite important in the characteriza- 
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Fig. 3—Spectrogram and computed pole-frequency of two-pole Lec model for the 
utterance, ‘‘Has anyone measured nickel concentrations -- - 
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tion of the digits, the ability of the two-pole Lrc analysis to track this 
motion has been used in the classification phase of the digit-recognition 
system. 

It should be noted, however, that when either F; and F. are suffi- 
ciently far apart or the amplitude of F; is significantly greater than 
the amplitude of F2, the pole frequency will either follow F; or result 
in two positive real poles. Figure 3 shows that, during the /i/ in the 
word “‘anyone,”’ the pole frequency begins to dip sharply as the separa- 
tion between F and F.2 grows greater and finally results in positive 
real axis poles as the separation reaches some critical threshold. For 
nasal sounds, the energy is so highly concentrated near the first reso- 
nance that the two-pole model usually results in positive real axis 
poles, as seen in Fig. 3. 

In addition to using the computed pole frequency as a measure of 
the location of dominant spectral energy and a characterization of the 
dynamic movement of F2, the normalized error of the two-pole model 
contains important information about the spread of spectral energy. 
The normalized or residual error is defined as 


v= i1— Qi", — Gee, 


where a, and a, are the two-pole urc coefficients and 7; and rz are the 
normalized autocorrelation coefficients. It can be shown that the more 
concentrated the energy spectrum, the lower the normalized error.® 
For speech sounds, the relative magnitude of the normalized error 
generally increases from sonorants to vowels and then to fricatives. 
Within the three vowel types, the back vowels have the lowest relative 
normalized error and the front vowels have the highest. By observing 
the relative changes in the pole frequency and normalized error, im- 
portant information about the structure of the voiced region of the 
word can be obtained. An example of the usefulness of the pole param- 
eter in specifying the speech sounds comprising the digits is given in 
the next section. 

In summary, a reasonable set of robust measurements that have 
been implemented for this digit recognition algorithm is as follows: 


(t) Zero crossing rate, which is defined as the number of zero cross- 
ings in a fixed frame length (on the order of 10 ms). 

(1) Energy, which is defined as the sum of the squared values of the 
speech waveform in a given frame. 

(22) Normalized error obtained from a two-pole LPc analysis of a 
given speech frame. 

(tv) Pole frequency (or frequencies) obtained from a two-pole LPc 
analysis of a given speech frame. 
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2.3 Self-normalization of parameters 


Almost all classification algorithms use some set of threshold levels 
in the decision process. Using a fixed set of thresholds leads to a large 
number of problems for speech recognition in that many of the thresh- 
olds are speaker- or time-dependent. To eliminate this difficulty, the 
technique of self-normalization of parameters was used in which many 
of the most significant thresholds were obtained from measurements 
made directly on the speech sample being recognized. Thus, for ex- 
ample, in the case of setting thresholds on zero-crossing rate to deter- 
mine whether a sound is noise-like or nasal, a statistical description of 
the zero-crossing rate (zcR) was made for the entire utterance. The 
statistical description consisted of measuring the mean of the zcr and 
its standard deviation over the region of strong energy (i.e., the region 
where the energy exceeded 10 percent of the maximum energy of the 
utterance). Based on zcR measurements, one criterion for classifying 
a segment as noise-like was if its zor exceeded a level one standard 
deviation above the mean during the segment. Figure 4 shows the zcr 
measurements for the word “‘seven.”’ Indicated in this figure are the 
average Zcr and a range of one standard deviation around this average. 
During the initial /s/, the zcr is significantly above the threshold, as 
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Fig. 4—Energy and zcr for one example of the word ‘‘seven.” 
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Fig. 5—Energy and zcr for one example of the word “nine.”’ 


In much the same manner, a self-normalized zcr threshold can be 
set for classifying a segment as a nasal-like sound. In such cases, the 
zcR generally falls below a level one standard deviation below the aver- 
age zcr for the utterance. As an example, Fig. 5 shows the measured 
zcR for the word ‘‘nine” and gives the spread of zcr around the average 
value. For the initial and final nasals, the zcr is much lower than the 
average and thus is a good indication of the nasals. 

The idea of determining thresholds based on measurements made 
during the course of the utterance being recognized can be used for any 
or all measurements described in the preceding section. For example, 
Fig. 6 shows the two-pole model normalized error and the pole fre- 
quency for the word ‘‘nine.’”’ The nasal sections are clearly charac- 
terized by low normalized error and a zero-Hertz pole frequency. In 
contrast, Fig. 7 shows the same measurements for the word “six.” 
Again, the noise-like sections are clearly depicted by the relatively 
high values of normalized error, pole frequency, and zcr. 

The transitional nature of the normalized error and pole frequency 
can be used to classify the vowels into types high, middle, and back. 
Figure 8 shows the normalized error and pole frequency throughout 
the word ‘‘two.” After the frication region, which is marked by high 
normalized error and low energy, the normalized error uniformly 
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Fig. 6—Complete set of measurements for one example of the word “nine.” 


decreases. The decrease in the normalized error is due to the fact that 
the vowel nature changes from front (because of the /t/) to back. Thus, 
without specifying any absolute thresholds, the constituent structure 
of the voiced section of the word can be obtained by noting the relative 
changes in the normalized error. As described earlier, changes in the 
pole frequency can also be used to indicate the constituent vocalic 
structure. As seen in Fig. 8, the pole frequency is continually decreasing 
throughout the voiced region in the word “two,” thereby indicating 
a continually decreasing second formant. As another example, Fig. 9 
shows the parameters for the word “four.” The gradually increasing 
normalized error and pole frequency are indicative of a progression 
from a back vowel to a middle vowel. 


2.4 Parallel processing 


Using the self-normalization technique, each robust measurement 
can, by itself, classify a speech sound into one of the six broad categories 
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Fig. 7—Complete set of measurements for one example of the word “six.” 


of interest. Unfortunately, the classification will not be error-free; but 
if the results of all the measurements are ‘“‘intelligently’’ pooled to- 
gether, then the classification performance can be significantly en- 
hanced. The operation of combining the measurements is termed paral- 
lel processing. Parallel-processing ideas have met with good success in 
other areas of speech processing.® 

The idea of parallel processing as it is used here not only involves 
a suitable combination of the robust measurements but also the in- 
corporation of certain structural constraints of the lexicon as addi- 
tional input. For example, as seen in Fig. 9, the initial section of the 
word shows only a slight indication of the initial fricative /f/ (i.e., the 
high zero crossing and normalized error for the initial 10 to 20 ms of the 
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Fig. 8—Complete set of measurements for one example of the word “‘two.” 


word), and one might conclude that there is no frication. However, for 
the digits, it is known that only 1, 9, and 8 do not normally begin with 
frication. Since the zcr, normalized error, and pole frequency are rela- 
tively too high for the digits 1 or 9, these digits can be safely omitted 
from consideration. In addition, a combination of the facts that the 
normalized error is low and increasing and that the pole frequency is 
increasing indicates that the voiced region in the word is more than 
likely composed of a back-type voiced sound followed by a middle-type 
voiced sound. Since the voiced section of the word “‘eight”’ is a front 
vowel sound, the odds are quite high that the word is not ‘‘eight.” 
Additional evidence that the word is probably not “eight”? can be 
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Fig. 9—Complete set of measurements for one example of the word “four.” 


obtained from the lack of a burst at the end of the spoken word. By 
pooling knowledge from the individual measurements with information 
about the structure of the words in the lexicon, the weak frication in 
the spoken “four” can be recognized. Thus, the major feature of 
parallel decisions is the ability to arrive at a correct decision even if 
one or more of the parallel inputs is in error. In the next section, we 
discuss the organization of the digit recognizer and the specific nature 
of the logic rules. 


iil. DIGIT RECOGNIZER 


As seen from Fig. 1, the first step in the recognition scheme is the 
important problem of endpoint alignment (determining the location 
of the spoken word during the recording interval). The algorithm used 
in this scheme has been described by Rabiner and Sambur" and has 
been shown to give reasonably good results over a wide variety of 
speakers and background levels. However, the algorithm sometimes 
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has trouble finding the end of the word when the speaker sighs or 
puffs after reciting the word. To compensate for this problem, the 
decision algorithm does not place too much dependence on the end 
region of the word. The ‘‘end region” is defined as the region from the 
end of the word to the point at which the energy first exceeds 10 percent 
of the maximum energy. Equivalently, an “initial region” is defined 
from the beginning of the word to the point at which the energy first 
exceeds 10 percent of the maximum. The remaining section is termed 
the ‘‘middle region.” The process of determining the three regions of 
the word is labeled “‘interval segmentation” in Fig. 1. 

Throughout the duration of the detected word, the four robust 
parameters discussed in Section 2.2 are measured once every 10 ms 
(i.e., every 100 points for a 10-kHz sampling rate) and smoothed using 
a nonlinear smoothing algorithm proposed by Tukey." In addition, 
the first two formant frequencies are computed using a 12-pole LPc 
analysis at three points during the middle region. These include the 
point of maximum energy, the beginning of the middle region, and the 
end of the middle region. Since formant frequencies are quite speaker- 
dependent, they were used in the decision process only as a supporting 
measurement to discriminate between sounds that were quite dissimilar 
when viewed in the fF; — F2 plane (/i/ and /a/ are examples). The 
supporting nature of the formant measurement is also necessitated by 
the fact that the extraction of formants is not a simple and unam- 
biguous task, and too great a reliance on these parameters is fraught 
with danger. 

Following the measurement phase, a preliminary class decision is 
made for the utterance. An expanded view of the preliminary decision 
box is given in Fig. 10. The decision algorithm is in the form of a tree 
structure that traces down the most probable branch to arrive at the 
decided digit. However, it should be noted that there are provisions in 
the algorithm for back-tracking if some measurement strongly suggests 
that an error has been made. 

The first branch in the tree is to decide whether or not the initial 
portion is nasal-like. As we discussed previously, this decision is based 
upon the fact that nasal-like sounds have relatively low zcr, low 
normalized error, and low pole frequency. If a nasal-like beginning is 
detected, the preliminary choice is between 1 and 9. As a further check 
on this preliminary 1, 9 decision, the ending region is checked for 
nasal-like characteristics. If there is no evidence of initial nasal-like 
sounds, the digits 1 and 9 are removed from further consideration. 
When the initial region is deemed nasal-like, a relatively simple deci- 
sion can be used to decide between 1 or 9. The digit 9 can be distin- 
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Fig. 10—Preliminary decision tree for digit classification algorithm. 
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guished by a sharp discontinuity in pole frequency (between the initial 
nasal and the vowel) and high normalized error during the transition 
from the nasal /n/ to the diphthong /al/ (see, for example, Fig. 5). On 
the other hand, as shown in Fig. 11, there is no discontinuity at the 
end of the nasal-like section in 1. 

Tracing along the decision tree, the next choice is to decide whether 
there is any definite indication of initial frication as shown by sig- 
nificantly high zcr, normalized error, and pole frequency. The positive 
detection of frication eliminates the choice of the word “eight.” If 
there is no definite frication, the end region is checked for a burst and 
the middle region is checked for front vowel-like characteristics. The 
formant parameters are also used to check if the middle region is com- 
posed of only front vowels. A suitable combination of the results of 
these tests is used to reject or accept the word “eight’”’ as the digit. It 
should be noted that the decision process is in the form of a hypothesis 
test. In other words, we assume that the spoken word is “eight” and 
check to see if the acoustic parameters are consistent with this hypoth- 
esis. In fact, the basic structure of the entire digit recognizer is to first 
hypothesize and then test the acoustic consequences of this hypothesis. 
The parallel processing aspect of the decision assigns the appropriate 
weight to a particular test. For example, the detection of a burst at 
this point in the decision tree is an almost 100-percent indication that 
the word is ‘‘eight.”” However, the lack of a burst does not necessarily 
preclude the possibility of ‘eight,’ and this result should be weighed 
accordingly. 

Assuming that we reject the spoken word as the digit 8, the remain- 
ing possibilities are 0, 2, 3, 4, 5, 6, and 7. The end region is then checked 
for fricative-like behavior. If frication is indicated, a hypothesis test 
on the digit 6 is made. The middle region is checked for front vowel 
characteristics, and the only timing measurement in the entire digit 
recognition program is performed. This measurement compares the 
relative duration of the initial frication plus ending frication to the 
length of the middle region. The frication duration is defined from the 
beginning (or ending) of the word until the point at which the zcr 
remains within one standard deviation of the average for three time 
frames. This definition can be modified when any abrupt discontinu- 
ities in normalized error or two-pole frequency indicate a more prob- 
able location for frication. In addition, the extent of frication is not 
allowed to go beyond the 10-percent maximum energy points that form 
the boundaries of the middle region. For the digit 6, the timing ratio 
should be less than one and the middle region should be less than 250 
ms. A combination of the results of the hypothesis test are used to 
verify that the spoken word was ‘‘six.”’ 
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Fig. 11—Complete set of measurements for one example of the word ‘‘one.”’ 


If no frication is indicated during the ending region, a test for nasal- 
like behavior is made. If this test is positive, then a hypothesis test of 
the digit 7 is made. From Fig. 12, we can see that zcr, two-pole fre- 
quency, and energy dip sharply during the consonant /v/. The hy- 
pothesis test consists of verifying these dips and checking the vowel 
characteristics on either side of the dip. Again, the combined output of 
the test determines whether to reject or accept 7. 

If the digits 6 and 7 are eliminated from consideration, the middle 
region is analyzed to ascertain its structure. The preliminary analysis 
is achieved by noting the relative change in normalized error. If the 
normalized error increases, then the structure is characterized as an 
initial back vowel to a middle or front vowel. Thus, for increasing 
normalized error, the digits 3, 4, and 5 are considered the most likely. 
The relative change in the three-second formant measurements are 
used as supporting evidence to confirm the structure. For the digits 
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Fig. 12—Complete set of measurements for one example of the word “seven.” 


3, 4, and 5, F. should be increasing. The final decision among the 
possibilities 3, 4, and 5 is easily achieved on the basis of the robust 
parameters and formant measurements. 

If the normalized error decreases during the middle region, the digits 
0 and 2 are then the most probable choices. A decreasing F2 helps 
support these choices. To decide between 0 and 2, a dip detector pro- 
gram is used to discover the presence of the sonorant /r/ as depicted 
by a slight dip in pole frequency, normalized error, and energy. Figure 
13 shows the typical dip behavior for these parameters during the 
spoken 0. The presence (or absence) of a dip is checked with other mea- 
surements to verify the final decision. 
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Fig. 13—Complete set of measurements for one example of the word ‘‘zero.”’ 


IV. EXPERIMENTAL RESULTS 


The experimental test of the digit recognizer was conducted in two 
parts. The first part consisted of 10 speakers (five women and five men) 
who each made 10 complete recordings of the 10 digits. The recording 
sessions were spaced over a five-week period to include the effects of 
time variation in the testing. The recordings were made in a quiet 
room with a high-quality microphone. The decision algorithm was not 
designed for the characteristics of each particular speaker, so as to give 
a true test of the speaker-independent nature of the scheme. The results 
of this experiment are shown in Table II. The average error rate is 2.7 
percent. 

A confusion matrix for each of the 100 tests of each digit is presented 
in Table III. The confusion matrix indicates that all occurrences of 
initial frication were correctly detected by the decision algorithm. In 
only 6 out of 200 examples of the digits 1 and 9 was the initial nasal- 
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Table I! — Error scores for first digit recognition experiment 


Correct Wrong Percent Correct 

Women 
SK 97 3 97 
KD 96 4 96 
CMcG 96 4 96 
BMcD 97 3 97 
SP 97 3 97 

Total 96.6 
Men 
Ms* 89 1 98.8 
LR 100 0 100 
RS 97 3 97 
AR* 88 2 97.7 
JH 97 3 97 

Total 98.1 

Sum 954 26 97.3 


* Missed one recording session. 


like consonant incorrectly determined. The confusion matrix also 
shows that most errors were made in the final detailed decision. More 
sophisticated processing would probably enhance the final decision 
and thereby make the system performance compatible with adaptive 
schemes. 


Table Ill —- Confusion matrix for first digit recognition experiment 


Word Spoken 


0 1 2 3 4 5 6 7 8 9 

0 93 0 3 0 0 1 0 0 0 1 

1 0 96 0 0 0 0 0 0 0 0 

2 2 0 90 1 0 0 0 1 0 1 

3 2 0 2 97 se 0 1 0 0 ; 

: 4 0 1 2 0 1 0 0 0 

Word Recognized 5 0 0 0 0 0 96 0 0 0 1 
6 0 0 0 0 0 0 97 0 0 0 

7 1 0 0 0 1 0 0 97 0 0 

8 0 0 ] 0 0 0 0 0 98 0 

9 0 1 0 0 0 0 0 0 0 93 

Individual Errors 5 2 8 1 1 2 1 1 0 5 


Total—26 errors out of 980 utterances 
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Table IV — Distribution of errors for the second digit 
recognition experiment 


Number of Digits Incorrect Total Percent Correct 
20 = 2 
Female (30) li 15 4 92.3 
Male (25) 17 8 0 96.8 
Total (55) 28 23 4 94.4 


To ensure the validity of these experimental results, another more 
challenging test was conducted. In this experiment, 55 speakers (30 
women and 25 men) were selected at random and asked to give one 
rendition of the 10 digits. Instead of using a high-quality microphone 
in a quiet environment, the input speech was taken from a close-talking 
microphone alongside a chattering Teletypewriter. The decision was 
performed on-line, and the speaker was only instructed when to say 
each digit. The results for this experiment show an average error rate 
of 5.6 percent. In addition, no speaker tested did worse than 2 out of 
10 wrong. The distribution of errors for this experiment is shown in 
Table IV. The distribution matrix indicates the generally good per- 
formance of the system. 

It should be noted that there was no effort in our experimentation 
to select speakers with good diction. The speakers represent dialects 
from most of the regions in the U.S. In informal on-line demonstrations 
of the system, many non-American speakers (French, Japanese, 
Indian, German) tried having their English-pronounced digits recog- 
nized. The informal results were in good agreement with the other 
experiments. In addition, an informal attempt to ‘“‘beat”’ the system 
by holding one’s nose or using falsetto also proved generally un- 
successful. 


V. DISCUSSION 


The digit-recognition system that has been described in this paper 
can be considered as a first pass in the direction of speaker-independent 
speech recognition. Our approach has been to describe a variety of 
speech sounds in terms of a set of robust measurements. We then 
devised a tree-structured decision algorithm that used these measure- 
ments to characterize the acoustic features of the presented word. The 
sequence of branches in the tree was designed to resolve the most 
obvious sounds and then proceed to the more difficult decisions. Thus, 
the relatively easy problem of distinguishing between noise-like sounds 
and nasal-like sounds was attacked first, and the determination of the 
vowel-like constituents was then determined. The output of the pre- 


DIGIT-RECOGNITION SYSTEM 101 


liminary decision tree was a small subset of the 10 possible digits that 
almost invariably included the spoken word. The major portion of the 
errors in the system were made in the box labeled “final decision” in 
Fig. 1. 

The preliminary decision tree (Fig. 10) incorporated some original 
ideas about self-normalization that effectively eliminated the need for 
tuning the system to the characteristics of a given speaker. Such a 
decision tree can be extended to prune down a much larger lexicon and 
arrive at a small list of possible choices. Improvements in the method 
of selection within the list of possibilities could lead to speaker-inde- 
pendent systems that can truly compete with the performance of 
adaptive schemes. Such improvements could result by incorporating 
more sophisticated probabilistic methods into the framework of the 
‘“‘hypothesize-verify” technique proposed in this paper. 

Our goal in the development of the digit-recognition system is to 
show that speaker-independent digit recognition is possible through an 
intelligent description of broad categories of speech sounds. This 
description uses what is known about the necessary characteristics 
of each category instead of blindly using pattern-matching algorithms 
to rigidly quantify the sounds. The later approach is doomed to failure 
for a large enough speaker population because it overlooks the fact 
that the information in the patterns contain as much personal informa- 
tion as linguistic information. 
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Vertically separated antennas are recommended to increase the trans- 
misston availability of line-of-sight microwave links by reducing the dura- 
tion and frequency of multipath fading events. The emphasis ts on applica- 
tion to the 4- and 6-GHz bands on links with negligible ground reflections. 
Necessary signal processing, links utilizing passive repeaters (reflectors), 
and overwater links are also treated. Some new experimental data are 
presented. 


I. INTRODUCTION 


Bell System microwave radio relay routes consist of links (paths, 
hops) that have an average length of about 26 miles. Transmission 
on most hops is along the line of sight, with antennas mounted on 
towers that are typically 250 feet high. In some cases, when line-of- 
sight transmission between towers is not practical, large reflectors 
on prominent points of terrain (passive repeaters) are used. Tower 
locations are selected to avoid ground reflections, but inevitable excep- 
tions, such as transmission across a lake, do occur. 

Multipath propagation during anomalous atmospheric conditions 
can give rise to destructive interferences at the receiving antenna; the 
resultant signal fluctuates (fades) and may be reduced to practically 
zero for seconds at a time. The corresponding interruptions to service, 
if permitted to occur, would be unacceptable. Interruptions to opera- 
tion can be avoided by switching from an unserviceable radio channel 
to a protection channel operating at a different radio frequency, since 
multipath fading, being an interference phenomenon, is frequency 
selective. Use of such protection (frequency diversity) has been re- 
stricted to conserve the frequency spectrum.' 

Space diversity is an alternative or additional form of protection 
from the effects of multipath fading.?-* Its effectiveness depends upon 
the fact that multipath propagation results in vertical structure of the 
electromagnetic fields at the receiving tower. Selection between two 
vertically separated antennas receiving at the same frequency is com- 
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parable in effect to a frequency-diversity system with a protection 
channel for every working channel (switched on a per-hop basis). 
Unfamiliarity with the principles of operation and unavailability of 
space-diversity equipment, as well as costs, have inhibited use of 
space diversity in the past. Currently, use of space diversity is in- 
creasing. One reason for this (apart from spectrum conservation) is 
that, in areas of high fading, frequency diversity alone cannot provide 
the desired transmission availability. 

Space-diversity engineering, as presented here, encompasses estima- 
tion of fading, determination from transmission availability objectives 
of the need for protection, and calculation of the antenna separation 
needed to obtain the required transmission availability. The effects of 
signal processing on the improvement available from an antenna pair 
are discussed, with particular attention given to threshold switching. 
Space diversity through passive repeaters and on over-water paths is 
also discussed. Clearance requirements are summarized in the last 
section; current work on antennas placed so low as to lack clearance 
under normal propagation conditions is discussed in Appendix A. 

The emphasis in this paper is on application in the 4- and 6-GHz 
frequency bands, although the expressions for the estimation of the 
amount of fading and the diversity improvement apply to other 
microwave frequencies. However, transmission in the 11-GHz band is 
also affected by rain, and this must be taken into account by additional 
reduction of the effects of multipath fading. This design aspect belongs 
more properly in a treatment of 11-GHz radio system design and is 
not discussed here. 


Il. DESCRIPTION OF FADING 


The rF power received after transmission over a microwave radio 
hop is never absolutely constant; even at noon, when the atmosphere 
has ‘‘stabilized,” there can be fractional dB excursions (scintillations 
recurring a few times per second), as well as slower excursions of a dB 
or two. In propagation experiments, the normal value of the received 
signal is determined from the peak in a signal-level histogram obtained 
over at least one-half hour at or near noon. This so-called free-space 
value of the received signal is determined repeatedly at least once a 
week to identify periods during which enhanced or depressed signals 
have resulted from the relatively steady atmospheric focusing or de- 
focusing. These periods, therefore, are not ‘“‘normal.” 

During fading, the received RF power can be practically zero for 
seconds at a time. The terminology to describe this is introduced in 
Fig. 1 through an example in which the free-space value is —30 dBm 
and a single, idealized fade decreases the received power temporarily 
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Fig. 1—Definitions of Z and fade duration (—30 dBm assumed normal as an 
example). 


to —80 dBm; levels in dB relative to normal are denoted by 20 log L. 
The time during which a signal is below a level is called the duration of 
fade of that level (the duration of a 40-dB fade is illustrated in Fig. 1). 
Average durations of fades are independent of microwave frequency” 
and are proportional to L; typical numerical values are given by (see 
also Fig. 2)*"!! 

{t) = 410 L seconds, L < 0.1. (1) 


As an example, the average duration of a 40-dB fade (ZL = 107°) is 4.1 
seconds, both at 4 and 6 GHz. 
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Fig. 2—Average fade durations ((t) = 410Z, L < 0.1). 


* An underlying assumption throughout this work is that antenna sizes and path 
lengths are as encountered most often in practice: antenna diameters are between 
akout 4 and 16 feet, and path lengths are between about 14 and 40 miles. 
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The sum of the durations of all fades of a particular depth is called 
“time below level.” It is proportional to L?, since the number of fades 
is proportional to L, and its numerical values are given by® 


T=rTL?, L <0, (2) 


where 7'y is the time period over which the summation of fade durations 
is made (a month, for example) ; the units of 7’ are those of 7’) (seconds 
are normally used). The fade occurrence factor r for heavy fading 
months (see Fig. 3) is® 


r = c(f/4)D*10-5, (3) 
where 


c = 4 over water and Gulf coast, 
= 1 average terrain and climate, 
= 1 mountains and dry climate, 

f = frequency in GHz, 


(4) 


and 


D = path length in miles. 
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Fig. 3—Multipath occurrence factor. 
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Fig. 4—Time below level in a heavy fading month (D = 26 miles, c = 1, 7’) = 31 
days = 2.68 X 108 seconds). 


As an example, values of 7’ as a function of fade depth for a 26-mile 
path (average length) and average terrain and climate are shown in 
lig. 4 for a heavy fading month. The lines have the decade of time per 
10-dB slope typical of multipath fading, specified by the L? functional 
dependence. The values of 7 at — 40 dB are 47 and 71 seconds at 4 and 
6 GHz, respectively. Based on an average duration of 4.1 seconds, this 
corresponds to 11 fades of 40 dB at 4 GHz. 

The coefficient c in (4) incorporates the effects of both terrain and 
humidity and is adequate for first estimates of expected fading in many 
cases. Differentiation between paths of identical climate but differing 
terrain can be done by introducing a terrain roughness parameter” 
quantifying common knowledge that paths over rough terrain fade 
less than paths over smooth terrain, presumably because stable atmo- 
spheric layering is less likely to occur over rough terrain. Terrain rough- 
ness is calculated from terrain heights above a reference level (sea 
level, for example) obtained from the path profile at one-mile intervals, 
with the ends of the path excluded. The standard deviation of the 
resulting set of numbers is the terrain roughness, denoted by w (see 
sample calculation in Appendix B). Applicable values of w range from 
20 feet (‘“‘smooth’’) to 140 feet (‘‘rough’’) ; values of 20 and 140 should 
be used when calculated values of w are less than 20 or larger than 140. 
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Modified for roughness, the equations for c become: 


c = 2 (w/50)-!, coastal areas, 
(w/50)-!3, average climate, 


= 0.5 (w/50)—4, dry climate, (5) 


where a roughness of 50 feet has been defined as “‘normal.”’ 


Ill. PERFORMANCE OBJECTIVES 


The time below level of the received signal, at a fade depth equal to 
the receiver fade margin, represents potential service-failure time. Pro- 
tection is needed if this exceeds the value set by transmission-avail- 
ability objectives. 

Bell System short-haul objectives limit service failure time to 0.02 
percent (two-way) annually on a 250-mile route due to all causes. One- 
half of this is allocated to causes associated with equipment, main- 
tenance, and plant errors. There are obvious exceptions to this; for 
example, unavailability on a route where all hops are exceptionally 
short will be due mainly to equipment outages, and may be so allocated. 

The allocation to fading, therefore, is 0.01 percent (two-way) 
annually. In the past, this allocation had been apportioned between 
multipath and obstruction fading (‘‘earth-bulge”’ fading). However, 
the occurrence of intolerable obstruction fading can be decreased by 
increasing clearance (higher towers or shorter hops) or by increasing 
system gain margin; reliability records show that there has been a 
gradual decrease in the occurrence of obstruction fading over the 
years.® In the construction of new hops or in the upgrading of older 
ones in locations where obstruction fading is known to occur (or by 
related experience is expected to occur), increased clearances (or limits 
on path lengths) are assumed to be used. Consequently, no allocation 
to obstruction fading is made in space-diversity engineering ; the entire 
0.01 percent two-way annual fading allocation is applied to multipath 
fading. The one-way annual multipath fading allocation for 250 miles 
becomes 0.005 percent or approximately 26 minutes per year (1600 
seconds per year). The corresponding allocation to a hop D miles long 
is 1600 X D/250 seconds per year (165 seconds per year for the average 
26-mile hop). 

Estimated annual time below level (for comparison with the objec- 
tive) is obtained from the equation for T in the previous section, with 
T') describing the length of the fading season. As a geographic average, 
the value of 7’) in this estimate is equal to the number of seconds in 
three months; this assumes that all significant fading is contained in 
two heavy and two medium fading months, which is equivalent to 
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Fig. 5—Annual time below level (average case). Objectives: (a) short haul = 165 
seconds/year, (b) long haul = 20 seconds/year. 


three heavy fading months.“ Estimation of the length of the fading 
season as a function of geographic location is discussed in Appendix C. 

Continuation of the example of Fig. 4 provides the annual time- 
- below-level curves in Fig. 5 (142 and 212 seconds below —40 dB at 4 
and 6 GHz, respectively). For a fade margin of 35 dB, considered here 
for the purposes of discussion, the time below level is too large, com- 
pared to the 165-second objective, by factors of 2.7 and 4 at 4 and 6 
GHz, respectively; protection against multipath fading (space or 
frequency diversity) is needed when the fade margin is 35 dB. At a 
fade margin of 40 dB, the 4-GHz channel in the example can get by 
without protection. 

For long-haul radio the overall objective is also 0.02 percent, but the 
route length is 4000 miles. However, long-haul transmission models 
assume that half of the hops never experience significant fading.44 With 
the addition of this assumption to those previously made in this sec- 
tion, the long-haul multipath allocation to a hop D miles long becomes 
1600 X D/2000 seconds per year (20 seconds per year for the average 
26-mile hop). In practice, long-haul radio has at least one protection 
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channel, which is required for equipment protection and maintenance 
and is used to provide frequency-diversity protection. Such use modi- 
fies the need for space-diversity protection (see Appendix D). Use of 
frequency diversity in short-haul radio can be expected to be infrequent 
because of regulatory restrictions.} 


IV. THE SPACE-DIVERSITY EFFECT 

During periods of multipath fading, deep fades of signals received 
on two vertically separated receiving antennas rarely overlap in time. 
The relatively few that do overlap give rise to simultaneous time below 
level (sum of durations of simultaneous fades, see Fig. 6), which is 
proportional to Z4 and can be expressed as7+315 


T, = T/Io, (6) 


where T' is the time below level of the signal received on the main 
antenna and Io is the available improvement, given numerically in 
practical units by the following (see also nomogram in Fig. 7) :7:8 


In = 7 X 107°v’s? f/DL?, s S 50, (7) 
where 


v = relative gain parameter (gain of secondary antenna relative 
to main antenna in dB is 20 log v), 

s = vertical separation of receiving antennas in feet, center-to- 
center, 

f = frequency in GHz, 

D = path length in miles, 


and 


L = level parameter (level in dB relative to normal is 20 log LZ). 
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ENVELOPE VOLTAGE 





20 LOG L dB 


Fig. 6—Definition of simultaneous fade. 
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Equation (7) applies only for the ranges of variables indicated on the 
nomogram in Fig. 7. Extrapolation of the scales may lead to errors. 
For example, under some conditions the increase in improvement due 
to an increase in separation over 50 feet may be small. Separations for 
which the available improvement is less than 10 should not be used; 
if possible, separations of at least 30 feet should be used. 

The example of Fig. 5 is continued in Tig. 8 to demonstrate drawing 
of a curve for 7',; the secondary antenna in the example has the same 
gain as the main antenna (v? = 1) and the vertical center-to-center 
separation of the antennas is 30 feet (s = 30). The values of J at —40 
dB are 145 and 97 for the 6- and 4-GHz channels, respectively. The 
values of 7’, at —40 dB are 212/145 = 1.46 seconds and 142/97 = 1.46 
seconds, identical for 6 and 4 GHz. This comes about because both 7 
and J, are proportional to frequency. Having established one point 
(1.46 seconds at — 40 dB) for 7',, we draw through it a line with a slope 
of a decade of time per 5 dB, as specified by the Z4 functional depen- 
dence. When J, becomes less than about 5, terms in addition to that 
proportional to L4 are needed to describe T',, which is why the line for 
Tin Fig. 8 is not extended all the way to the left into the region (nor- 
mally not of practical interest) where 7’, approaches T. 

The simultaneous time below level in Fig. 8 is smaller than the 165- 
seconds-per-year short-haul objective for fade margins larger than 
about 30 dB; it is smaller than the 20-seconds-per-year long-haul 
objective for fade margins larger than about 34 dB. 


V. COMPARISON OF SPACE AND FREQUENCY DIVERSITY 


Space diversity in its most common form provides a protection 
channel for every working channel (1 X 1 protection) on a per-hop 
basis. Frequency diversity usually provides one or two protection 
channels for m working channels (1 X m or 2 X m protection) on the 
basis of switching sections that can contain as many as 10 hops in 
extreme cases. The most effective form of frequency diversity is, of 
course, 1 X 1 on a per-hop basis (now restricted in use at 4 and 6 GHz 
because of spectrum conservation) ; this can readily be compared to 
space diversity. 

For equal performance the available improvements, Jo, are equated. 
A convenient form for J is 


I) = vgL, (8) 
where for space diversity (from the previous section) 


g= 7X 10-s:f/D, s S 50 (9) 
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Fig. 8—Annual simultaneous time below level for 30-foot separation of antennas 
(continuation of average-case example from Fig. 5). 


and for frequency diversity?® 
q = 50(Af/f)/fD, Af < 0.5 GHz, (10) 


where f is the frequency in GHz (4 or 6), and Af is the difference of 
radio channel center frequencies, also in GHz; D is the path length in 
miles. Values of separations in space and frequency providing equal 
performance (for antennas of equal size; v? = 1) are obtained by 
eliminating g from (9) and (10): 
s = 106 VAf, in the 4-GHz band, 
= 57.5 VAf, in the 6-GHz band, (11) 
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Fig. 9—Separations in space and frequency providing equal protection (one-for- 
one switching on a per-hop basis; antennas of equal size, v? = 1). 


where s is in feet. A 30-foot separation is equivalent to a Af of about 
0.08 GHz in the 4-GHz band and about 0.27 GHz in the 6-GHz band 
(Fig. 9). 


Vi. SIGNAL PROCESSING 


The signals received by the two antennas must be processed to obtain 
a diversity signal. Static addition at RF is not practical, since the dura- 
tion of one-half cycle at, say, 4 GHz is § nanosecond, and delay changes 
of this magnitude in the relative arrival times of the two signals arise 
easily because of daily angle-of-arrival variations. Without dynamic 
phase compensation, such changes would lead to signal cancellations 
during normal daytime operation, even when multipath fading is not 
present. The cost of dynamic phase compensation has so far been 
prohibitive. 

Addition at baseband without dynamic phase compensation has 
been used (duration of a half cycle at 10 MHz is 50 nanoseconds). In 
some implementations, the weaker signal is dropped from the sum, 
when the relative signal strength ratio exceeds some 4 dB, to improve 
the signal-to-noise ratio of the diversity signal. Simultaneous time 
below level can be used to approximate the time below level of the 
diversity signal. 

Switching, particularly at Rr, permits an economically advantageous 
configuration of equipment. The simultaneous time-below-level curves 
(Fig. 8) describe the performance of an idealized comparator (switch), 
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where the diversity signal is, at every instant, the stronger of the two 
received signals. The switching activity, accompanied by undesirable 
step phase changes, is high. This can be reduced by introducing hys- 
teresis, but at the cost of reduced performance. Suppose switching 
occurs only when the ratio of the received powers is larger than a 
number b? (10 log b? in dB). The improvement realized becomes 


= nto, (12) 
where an estimate of the efficiency factor 7 (see Appendix E) is 
a 2/0 Do) (13) 


As an example, when 10 log b? is 6 dB, 7 is 0.47, and an available im- 
provement of 100 becomes a realized improvement of 47. 

A different approach must be used when only one receiver per radio 
channel is available, which is the case in long-haul radio. The diversity 
implementation utilizes an RF waveguide switch activated by the 
aac voltage when the receiver input falls below a threshold (one 
example of switching logic is shown in Table I). Switching of this sort 
has been referred to as threshold or “blind” switching, since at the 
instant of switching there is no assurance that a stronger signal will 
be available at the other antenna. Most of the time, a stronger signal 
zs found, since few of the deep fades on the two receiving antennas 
overlap in time. The time below level 7’; for threshold switching is cal- 
culated (see Appendix F) in Fig. 10 for a 4-GHz channel (in continua- 
tion of the example from Fig. 8); the threshold is at —35 dB relative 
to normal. The values of 7, and 7, are identical at the threshold. Above 
threshold, 7; approaches T rapidly, since the switch does not act for 
fades during which the minimum signal does not drop below —35 dB. 
The part of T, of importance in space-diversity engineering is the 
straight-line section below threshold. This straight-line section is 
parallel to T, since it represents an average of the two single-antenna 
statistics by virtue of the continued cycling of the switch due to failure 


Table | —~ Switching logic of a threshold switch 








Switch 
Output . 
Relative Switch Action 
45. Sontch Connected to 
Threshold 
High Main antenna Stay on main antenna 
High Secondary antenna | Switch to main antenna after 15 minutes 
Low Main antenna Switch to secondary antenna after 0.1 second 
Low Secondary antenna | Switch to main antenna after 0.1 second 
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Fig. 10—Time below level in threshold switching—continuation of average-case 
example (4 GHz, threshold at —35 dB, 30-foot separation of antennas). 


of the receiver to locate a signal stronger than the threshold value, 
which is —35 dB relative to normal in the example. 

A suitable value of the threshold is about 2 dB above the fade margin. 
The —35 dB threshold in the example would be appropriate for a fade 
margin of 37 dB. The improvement (J; = T/T;) is constant below 
threshold (about 30 in the example) and equal to the value of Jo at 
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Fig. 11—Example of reduced performance due to smaller secondary antenna (10 
log v? = — 6, other parameters as in Fig. 10). 
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the threshold. This is one reason for placing the threshold close to the 
fade margin, since a threshold at —30 dB, for example, would provide 
an I, of only about 10. A second reason is to limit the occurrence of the 
phase steps due to the cycling of the switch to occasions when the 
signals border on the unusable (actually, the chance is small that the 
signals remain for prolonged periods in the 2-dB corridor between the 
threshold and the fade margin, and many of the phase steps will occur, 
therefore, when the signals are unusable). The cycling should occur at 
a rate of at least five times per second; at much slower rates, a good 
signal is not found fast enough and performance deteriorates. Cor- 
ridors smaller than 2 dB should be avoided, since signal drifts caused 
by aging of equipment or noise accumulation from hop to hop in a 
switching section can frustrate design intentions and require frequency 
diversity action before space diversity. 

Use of a smaller secondary antenna reduces performance, as illus- 
trated in Fig. 11, where the only change from Fig. 10 is that a second- 
ary antenna with 6 dB less gain (v? = 0.25) has been used. The im- 
provement J, has decreased to about 8 (& 30/4) from about 30 in the 
equal-size-antenna, case. 


Vil. SPACE-DIVERSITY TRANSMISSION 


Space diversity for microwave radio is usually envisioned as diversity 
reception; i.e., the system consists of a single antenna which transmits 
to two vertically separated receiving antennas. Since the path loss 
from antenna to antenna does not depend on the direction of trans- 
mission, an arrangement of two vertically separated transmitting 
antennas and a single receiving antenna can also be used. 

Combined use of transmitting and receiving diversity can reduce 
tower work; on a long route only alternate towers would be affected. 
A single bad hop can be corrected by additional installation at one 
end only. On hops where additional installation at one end is impossible 
because of economics or zoning, use of both transmitting and receiving 
diversity at the other end provides full diversity protection in both 
directions. 

A drawback to transmitting diversity is the vulnerability of the 
control signal. In diversity reception, the signals from both antennas 
are always available at the switching site, to be processed in any de- 
sired manner. In transmitting diversity, only one antenna can transmit 
at a given frequency at a given time (dynamic phase correction for 
control of the two transmitters does not appear feasible or desirable). 
The required control information must be fed back from the receiving 
to the transmitting end, since control must be based—because of the 
frequency selectivity of deep fades (see Appendix G)—on fading in 
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the controlled channel. The feedback link increases both the cost of 
control and the possibility for failure of control. 


Vill. FIELD EXPERIENCE 


The basic experimental data on simultaneous fading on vertically 
separated receiving antennas were obtained in Ohio and Texas in 
1966. The variation of the improvement as the square of the vertical 
separation of the antennas was further verified experimentally in 
Georgia in 1968 (Appendix H). Field followup in terms of monitoring 
performance of in-service space diversity was carried out in 1972 in 
California and Florida. The monitoring was made possible by the 
development of Portable Propagation Recorders (prr), the first 
installation of which was at Brawley, California, in May 1972. 

Space diversity in the California case was installed by Pacific Tele- 
phone and Telegraph Co. to improve transmission availability of two 
hops (Salton-Brawley and Brawley-Glamis traversing Imperial Valley) 
on the Dallas-Los Angeles route. Extensive irrigation and high tem- 
peratures with little wind combine to create severe fading on the two 
hops in question. Mcasured and calculated values of J) are compared 
in Table II (Fig. 12 is an example of measured data). RF threshold 
switching was used, and the comparison was made at the threshold 
value (~—35 dB relative to normal) where J; and J are equal.!? The 
agreement of calculated and measured values is good, showing that the 
equation for I, is applicable under diverse climatic conditions. 

Two hops monitored in Florida (Andytown South-Andytown North 
and Andytown North-Okeelanta in Northern Everglades) were 
equipped with secondary receiving antennas and RF threshold switches 
as part of a program to improve transmission availability of hops that 
fade heavily. The measured results and the predictions for Andytown 


Table I!— Comparison of calculated and measured values of /o 
(Salton-Brawley and Brawley-Glamis, California)* 
Relati I) at 20 log L = — 35 
elative 
Antenna Frequency Path Gain of 
Spacing (GHz) Length Secondar Calculated 
(feet) (miles) Af from Measured 
Antenna, (v?) Oo 
Equation (7) 
30 4 42.6 0.41 7.6 8.0t 
15 4 42.6 0.41 1.9 1.53 
15 4 37.1 0.4 3.2 2.5§ 








* Antenna configurations were dictated by circumstances ; generally, configurations 
providing an improvement of at ere 10 would be used. 

t Brawley to Glamis, 7/7-8/14, 1972. 

+ Glamis to Brawley, 5/11-7/3, irs, 1972. 

§ Salton to Brawley, 8/17-8/29, 1972 


118 THE BELL SYSTEM TECHNICAL JOURNAL, JANUARY 1975 


GLAMIS — BRAWLEY, CALIFORNIA 
JULY 7 — AUGUST 14, 1972 
42.6 MILES, 4 GHz 
To = 38 DAYS 


= 3.2 x 106 SECONDS 


TIME BELOW LEVEL IN SECONDS 


THRESHOLD: -35 dB 
10 LOG v2 = -3.9 
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Fig. 12—Threshold switching, Glamis to Brawley, California. Reception at (a) 
Glamis, main antenna (unprotected), (b) Brawley, 15-foot separation, (c) Glamis, 
30-foot separation. 


South to Andytown North are summarized in Fig. 13.18 At —47 dB 
(bottom level of the ppr for this case), the predicted time below level 
in the 148-day test period was insignificant (only a few seconds), and 
none was measured. Diversity performance thus conformed to expecta- 
tions and the needed improvement in transmission availability was 
obtained. 


IX. PATHS WITH PASSIVE REPEATERS 


Use of passive repeaters (large reflectors of, say, 40 by 48 feet) on 
ridges or hilltops is sometimes dictated by terrain or by the unde- 
sirability of active repeaters in remote locations (power and mainte- 
nance-access problems). Tests on a Mountain Bell hop (Lusk-Wen- 
dover, Wyoming) in 1973 established that space diversity is operative 
through passive repeaters.!® A design procedure for practical cases, 
based on the Wyoming results, is: 


(t) Determine reflector size to obtain reasonable fade margin. 

(it) Estimate unprotected time below level (7) as the sum of the 
values of T for the individual legs (distances from reflector to 
ends of hop). 
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JULY 3, 1972 — DEC. 4, 1972 
To = 1.28 x 107 SECONDS 
= 148 DAYS 


ANDYTOWN NORTH 


FROM ANDYTOWN SOUTH, 
FLORIDA 


f = 3.910 GHz 
D = 20.53 MILES 
s = 25 FEET 


10 LOG v2 = -3.7 
THRESHOLD AT -44 dB 
I, = 90 


TIME BELOW LEVEL IN SECONDS 





-25 -30 -35 -40 -45 -50 
LEVEL IN dB RELATIVE TO NORMAL, 20 LOG L 


Fig. 13—Threshold switch operation at Andytown North, Florida. 


(iit) Determine improvement needed to meet transmission avail- 
ability objectives. 

(iv) If improvement is needed, determine vertical spacing based on 
longest leg (use at least 20 or 30 feet) ; use this vertical spacing 
at both ends of hop. 

(v) If an antenna pair is in the near field of a reflector, ensure that 
projected reflector height is not less than the distance from the 
bottom edge of the bottom antenna to the top edge of the top 
antenna. 


The experimental data obtained at Wendover and Lusk are sum- 
marized in Figs. 14 and 15 (the parameters of the experiment are sum- 
marized in Table III). Time-below-level differences for individual 
antennas in a pair arise from gain differences in instrumentation 
chains; curves fitted to the points have the standard decade of time per 
10-dB multipath slope in the deep-fade region. The curves fitted to the 
simultaneous fading points have the standard decade of time per 5-dB 
slope in the deep-fade region. Space-diversity reception at Lusk 7s 
effective (Fig. 15), which is a result that could not be predicted using 
present methods. Successful space-diversity operation at Wendover 
could be predicted beforehand by viewing the transmitting antenna at 
Lusk as a feed illuminating a large aperture (the double reflectors), 
since little fading would be generated on the short Lusk-to-reflectors 
leg. 

The measured improvements at —35 dB, 50 at Wendover and 30 at 
Lusk, are larger than the calculated values of 40 and 16, respectively. 
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WENDOVER, WYOMING 
APRIL 12 — JULY 10, 1973 


To= 89 DAYS 
= 7.69 x 108 SECONDS 


TIME BELOW LEVEL IN SECONDS 





= 24 FEET 
= 6049 MHz 
05 -20 -25 -30 =35 =40 
LEVEL IN dB 


RELATIVE TO MAIN ANTENNA NORMAL, 20 LOG L 


Fig. 14—Space diversity on path containing passive repeater ; reception at Wend- 
over, Wyoming. 


The difference arises partly because use of the longest leg in the cal- 
culations is a simple approximation of a complex situation and partly 
because some scattering of measured points around the averages pre- 
dicted by the equation for J) can be expected. 
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LUSK, WYOMING 
APRIL 12 — JULY 13, 1973 


TIME BELOW LEVEL IN SECONDS 





102 
= 92 DAYS 
= 7.95 x 10© SECONDS 
= 24 FEET 
= 6301 MHz 
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-15 -20 -25 -30 -35 -40 
LEVEL IN dB 


RELATIVE TO MAIN ANTENNA NORMAL, 20 LOG L 


Fig. 15—Space diversity on path containing passive repeater; reception at Lusk, 
Wyoming. 
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Table II!— Parameters of Lusk Radio test—Lusk to 
Wendover, Wyoming 


Reflectors Two 40- by 48-ft reflectors 
separated by 138.5 ft 


Path lengths from reflectors: 


To Wendover 31.5 miles 

To Lusk 9.5 miles 
Antenna separations at Wendover and Lusk 24 ft (nominal) vertical center to 

center 

Frequencies : 

Wendover 6049 MHz received 

Lusk 6301 MHz received 
Calculations: 


Values of v? arising from gain differences in 
instrumentation chains: 


Wendover v= 1.6 
Lusk v? = 0.63 
Calculated improvement at —35 dB based on 
31.5-mile path length 
Wendover (v? = 1.6) Ip = 40 
Lusk (v? = 0.63) Io = 16 
Allocation to multipath fading from 1600 X 41/250 = 262 seconds 
performance objectives for 41 miles per year 


(31.5 + 9.5) 


The multipath objective for the Wendover-Lusk link is 262 seconds 
per year one way (see Table ITI). Even allowing the annual time below 
level to increase by a factor of two due to late summer and fall fading, 
the objective can be met comfortably with receiver fade margins in 
the 35- to 40-dB range. 


X. OVER-WATER PATHS 


Over-water paths, undesirable because of reflections, are sometimes 
unavoidable. When the over-water area is too large to permit shifting 
the reflection point off it via a high-low antenna combination, space 
diversity can be used to reduce reflection fading. A procedure for 
determining antenna spacing is discussed in terms of the geometry 
shown in Fig. 16, in which the reference plane coincides with the sur- 
face of the water during ‘‘flat-earth’ propagation conditions (i.e., 
those obtaining when K, the ratio of equivalent to actual earth radius, 
is infinity). The “bulge” height h of the surface reflection point A is 


h = did2/1.5 K, (14) 


where h is in feet when d; and dz are in miles. If the height he of the 
receiving point were changed, then the relative phase of the rays 
arriving via the paths TR and TAR would also change. For a fixed 
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Fig. 17—Lake surface at various K values in over-water path example (N denotes 
boundary of Nth Fresnel zone at a frequency of 4 GHz). 
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transmitting antenna height h,, the increment in he for a change from 
an in-phase to an out-of-phase condition (half interference fringe 
spacing) is approximately (for large K), in feet, 


Ahe & 1300D/f(h: — h), (15) 


where h; and fh are in feet, D is the path length in miles, and f is the 
frequency in GHz. 

The desired antenna separation for good diversity, therefore, would 
be equal to Ahe, but this is possible for one value of K only, since Ahe 
increases as h increases (an increase of h results from a decrease of K). 
For protection over a large range of K, a suitable initial choice for the 
antenna separation is 

s = 1800D/fh., (16) 


which corresponds to Ahz for K = «©. The appropriateness of this 
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Fig. 18—Values of K at which lake surface is tangent to even Fresnel zone boun- 
daries at a frequency of 4 GHz. 
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choice must be verified. In a recently examined situation (shown in 
Fig. 17) the path length D is 27.8 miles, the transmitting antenna 
height h; is 307 feet, and the receiving antenna height hz is 255 feet; 
since the planned hop is on the Gulf coast, the antennas are placed 
high to avoid obstruction fading. Protection from fading created by 
reflections from the lake surface is desired for values of K ranging from 
4 to about —4 (Fig. 17 and the subsequent drawings were generated 
very easily on an xy-plotter driven by a programmable desk calculator). 
Values of K for which the lake surface is tangent to even Fresnel zone 
boundaries (determined by trial and error) are shown in Fig. 18; at 
these values of K, signal minima occur at the 255-foot height. For 
4-GHz transmission, a first choice for diversity antenna separation 
would place the second receiving antenna at the 225-foot height 
[ (1800 X 27.8)/(4 X 307) & 30; 255 — 30 = 225]. Simultaneous sig- 
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Fig. 19—Even Fresnel zone boundaries at 225-foot height for bottom receiving 
antenna at a frequency of 4 GHz. 
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nal minima will not occur on the two antennas because even Fresnel 
zone boundaries for the 225-foot height are not tangent to the lake 
surface at K values that correspond to signal minima on the top re- 
ceiving antenna (see Fig. 19). The question of optimization now arises. 
It would be desirable to place the bottom antenna somewhat lower to 
move the boundary of F, down, further away from the lake surface 
at K = 0.88; this would bring the boundary of F12 closer to the lake 
surface at K = —1.7 and, therefore, is not desirable. If the design 
requirements were relaxed to cover values of K between 4 and & only, 
then the bottom antenna could be brought further down to make the 
separation closer to the value of Ah: at K = §. 

A possible alternative location of the bottom antenna, foreground 
clearance permitting, is the 85-foot height (see Fig. 20). This has some 
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Fig. 20—Even Fresnel zone boundaries at 85-foot height for bottom receiving 
antenna at a frequency of 4 GHz. 
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of the properties of a high-low shot, and the even Fresnel zone bound- 
aries are well placed. 

A final observation on space-diversity design for protection from 
reflections is that the antenna spacings often are suitable, fortunately, 
for protection from atmospheric multipath fading. 


XI. CLEARANCE RULES 


Operating experience indicates that propagation conditions in 
various sections of the United States can be classified, in broad terms, 
as good, average, or difficult as shown on the map in Fig. 21. Desired 
clearances for transmission between main antennas are summarized 
in Table IV. In addition, hop lengths in coastal areas of the southern 
United States are sometimes restricted to 20 miles to reduce obstruc- 
tion fading. 

Clearance requirements for antennas added to provide space- 
diversity protection from multipath fading are less stringent: 0.6 F 
at K = 4, with a foreground clearance of 10 feet in the first 500 feet 
from the antenna. In many cases, this permits placement of the sec- 
ondary antenna below the main antenna. 

It appears (from recent results discussed in Appendix A) that it 
may be possible to relax the secondary-antenna clearance require- 
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Fig. 21—Geographic occurrence of good, average, and difficult propagation 
conditions. 
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Table IV — Clearance rules for top antennas 


Propagation 


Conditions Clearance 
Good 0.6 F, at K = 1 (but not less than grazing at K = 2) 
Average The larger of 0.38 Fi at K = 2 and F, at K = 4 
Difficult Grazing at K = 3 


ments; however, results of further tests (now in progress) must be 
evaluated before general recommendations on this can be made. 


Xil. ACKNOWLEDGMENTS 


We are indebted to W. T. Barnett and E. E. Muller for useful dis- 
cussions. The Multiple Input Data Acquisition System (mipas) and 
Portable Propagation Recorder (PPR) were designed by G. A. 
Zimmerman. 


APPENDIX A 
Effects of Reduced Clearance 


The purpose of the bottom antenna in a space-diversity pair is to 
provide protection from multipath fading; its clearance therefore need 
not be based on requirements for protection from obstruction fading. 
Placement of secondary antennas lower than currently permitted would 
reduce tower costs because of reduced wind loading. 

An investigation of multipath fading as a function of antenna height 
was undertaken at Palmetto, Georgia in 1972 at 4.198 GHz on a 26.4- 
mile path from Atlanta, Georgia.”° The path profile (Fig. 22; arrows 
denote trees) shows that the controlling obstruction is located some 6 
miles from the receiving antennas at Palmetto. Lines of sight that 
graze the obstruction are shown in Fig. 22 for three kinds of atmo- 
spheric conditions. Under normal conditions the gradient of the index 
of refraction is —39 N units per kilometer,* and the ratio K of the 
equivalent-to-actual earth radius is 4. When the gradient becomes 
—157 N units per kilometer, the so-called flat-earth condition exists 
(K = o~) and clearance is substantial. For changes of the gradient in 
the opposite direction, clearance becomes reduced; in path design the 
value of interest is K = 3, which corresponds to a positive gradient of 
79 N units per kilometer. 

The projections of the grazing lines on the Palmetto tower are given 
by the tops of the three bars on the right-hand side of Fig. 23, at 30, 


* The radio refractive index of air, n, is frequently expressed asn = 1+ N X 10, 
where WN describes the refractive index in “N units.” 
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Fig. 23—Receiving antennas at Palmetto, Georgia in 1972. 
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110, and 185 feet from ground. The center line of the horn-reflector 
receiving antenna is 332.5 feet from ground. The center lines of the 
antennas added for test purposes are labeled D6 through D9. The 
antenna at D7 was a small 15-dB-gain horn; the other three are 4-foot- 
diameter parabolic reflectors. The normal antenna height for recep- 
tion from Atlanta would be at about D6; the tower is much higher 
because clearance on another path controls its height. 

The antenna of interest is the bottom 4-foot parabolic reflector at 
D9, with center line 95 feet above ground. The center line is 15 feet 
below the point where the grazing line under normal conditions 
(K = $) projects on the tower. The clearance in Fresnel zones is 
about —0.2F;. The center line of the bottom antenna falls on the 
grazing line when K is about 1.72, which corresponds to an index of 
refraction gradient of about —66 N units per kilometer, which is 27 N 
units per kilometer higher than the gradient of —39 N units per 
kilometer under normal K = 4 conditions. 

The below-grazing location of the bottom antenna was further veri- 
fied by noting that there was a signal loss of 14 dB when it was moved 
from its previous (higher) location on March 14, 1972. A theoretical loss 
of about 8 dB would occur if the obstruction were a knife edge; diffrac- 
tion over a tree-covered hilltop accounts for the additional 6-dB loss. 

Data were analyzed on received power (recorded by mipas”) 
during time intervals containing deep fading (more than 20 dB) on 
any of the five receiving antennas in April, May, and June of 1972. 
The total of such time intervals was about 13 hours during which the 
amount of fading was similar for the top four antennas. The a priori 
expectation was that the bottom antenna would fade more, since its 
signal level during normal daytime conditions was 14 dB low because 
of a lack of clearance. The opposite occurred: the signal received by 
the bottom antenna faded less than signals received by the four upper 
antennas. 

The measured time-below-level data (during the 13 hours of deep- 
fading activity) for the bottom antenna (D9) and the one above it 
(D8) are shown in Fig. 24 (fading on the other antennas, after adjust- 
ment for gain differences, was the same as that on D8). The single 
antenna curves have the normal slope (decade of time per 10 dB) for 
deep multipath fades. Time below level for the bottom antenna is 
smaller than that for D8 by about a factor of three in the deep-fade 
region. 

Diversity performance of the combination of D8 and D9 was ex- 
cellent. Simultaneous fades on the two antennas were never deeper 
than about 18 dB from normal (see Fig. 24). 

These results relate to aspects of radio propagation not covered by 
previous experience or theory. Physically, the observations can per- 
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Fig. 24—Fading measured at Palmetto, Georgia. 


haps be explained by the presence of layers with strong negative 
gradients in the index of refraction that create fading but offset the 
normal lack of clearance for the bottom antenna. Furthermore, since 
the bottom antenna is relatively close to the ground, the terrain blocks 
some of the potentially interfering rays, which would also tend to 
reduce the amount of fading. 

A tentative conclusion is that protection from multipath fading can 
be obtained by placing the bottom antenna somewhat below grazing 
under normal atmospheric conditions (diffraction loss of perhaps 10 
dB). It must be established that antennas placed in such a manner 
perform well consistently; results from tests at Culver, Indiana 
(1973-1974) are encouraging. 


APPENDIX B 


Example of Roughness Calculation 


The terrain heights (denoted by z;, terminal ends omitted) for the 
19-mile path in the example of Fig. 25 are provided in Table V. 
The roughness (standard deviation) is the square root of the average 
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ONE MILE 
Fig. 25—Determination of terrain roughness. 


Table V — Example of terrain heights for 19-mile path 


(see Fig. 25) 
t x; (ft) a x; (ft) 4 x; (ft) 
1 600 7 400 13 450 
2 625 8 420 14 420 
3 515 9 460 15 390 
4 440 10 420 16 480 
5 480 11 450 17 520 
6 450 12 480 18 550 
square of the deviation from the mean: 
1 28 es 
v= Sigh, 
138 , 
= ig mo xi| — M? (17) 
1 18 
M = 18 > wv: = 8550/18 = 475 (18) 
i=1 
w = v(4133950/18) — (475)? = 63.5 feet. (19) 


APPENDIX C 
Length of Fading Season 


Multipath fading is a warm-weather phenomenon. Assuming that 
the length of the warm portion of the year is proportional to the aver- 
age annual temperature, the value of 7’) to be used in eq. (2) to esti- 
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(Reprinted from the 1963 Weather Handbook, p. 194, by permission of the publisher, Conway Research, Inc., Atlanta, Georgia. No further reproduction is authorized.) 
Fig. 26—Average annual temperatures in °F. 


mate the annual time below level is 
T) = (¢/50)8 X 10° sec, 35 St S 75, (20) 


where é¢ denotes the average annual temperature of the locality in 
question in °F as determined from Fig. 26. 

In the average case (Figs. 5 and 8), 7’) is equal to the number of 
seconds in three months; eq. (20) reduces to this when ¢ is 50°F. The 
temperature contours in Fig. 26 show that 50°F is appropriate for 
middle latitudes in the United States and for Ohio in particular; data 
from Ohio have often been used to describe average fading. 

The range of average annual temperatures (Fig. 26) is from 35°F 
in northern North Dakota to 75°F in southern Florida. The corre- 
sponding fading season lengths range from 70 percent to 150 percent 
of average (7') ranges from 2 months to 45 months). 


APPENDIX D 
Addition of Space Diversity to Frequency Diversity 


In the Bell System, fully loaded long-haul routes with radio channels 
in both the 4- and 6-GHz bands will utilize cross-band frequency- 
diversity protection with two protection channels—18 working chan- 
nels in a 2 X 18 system.“ Expectations are that space diversity in 
2 X 18 protection systems will be needed only when problem hops 
are encountered—exceptionally high fading activity or unavoidable 
ground reflections—that will have to be treated on an individual basis. 
The need for space-diversity protection will arise on long-haul routes 
that have radio channels in one frequency band only, 4 or 6 GHz; the 
single frequency-diversity channel permitted on such routes! may not 
provide adequate protection in geographic areas where fading activity 
is above average. 

The time below level at the fade margin of an average working 
channel in a frequency-diversity system with one protection channel 
and m working channels (a 1 X m system) is approximately equal to 
the simultaneous time below level of channels in an equivalent 1 X 1 
system in which the separation of the center frequencies of the two 
channels is 


Afeg = m/LX (1/Afz) J, (21) 


where the sum contains 3m(m + 1) terms, extending the summation 
over frequency separations of all channel pairs in the 1 X m system. 
The equivalent 1 X 1 system (introduced to simplify calculations) 
is derived from the first term in the alternating series describing trans- 
mission unavailability! under the assumption that equipment works 


134 THE BELL SYSTEM TECHNICAL JOURNAL, JANUARY 1975 


perfectly during fading (equipment failures are accounted for via the 
allocation of transmission availability objectives). 

The simultaneous time below level of the two channels in the 
equivalent 1 X 1 system is 


Teg = T/Teq, (22) 
where T' is the time below level in an unprotected channel and 
Teg = Qegh™, (23) 


with deg determined by using Af., in eq. (10). As an example, in a 
1 X 3 system with 60 MHz between channels 


fea = = 
‘= (760) + (2/120) + (1/180) 
= 41.5 MHz. (24) 





Similarly, in a 1 X 7 system with 30 MHz between channels, 
Afeg = 15.8 MHz. (25) 


The use of these values of Af., to calculate 7’., for one hop is illustrated 
in the example in Table VI and Fig. 27. Ina multihop switching section, 
T eq would be determined by adding values of 7’. determined for each 
hop. 


Teg FOR 1 x 7---> 


Teg FOR 1 x 3--—> 


ANNUAL TIME BELOW LEVEL IN SECONDS 


OBJECTIVE <. 
= 





-20 -25 -30 -35 -40 
SIGNAL LEVEL, dB RELATIVE TO 
NORMAL, 20 LOG L 


Fig. 27—Example of need for space-diversity protection (see Table VI for path 
parameters and calculations). 
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Table VI— Sample calculations 


Path parameters D = 20 miles (southern coastal area) 
f =6 GHz 
w = 20 feet 
t = 70°F 
Fading in an unprotected channel c = 2(w/50)13 = 6.58 
r = ¢(f/4)D310-5 = 0.79 
To = (t/50)8 X 108 = 1.12 X 107 seconds 
T =rT)L? = 8.8 X 10° L? seconds/year 
Long-haul objective 1600 < D/2000 = 16 seconds/year 
Equivalent 1 X 1 system Afeq = 41.5 MHz 
replacing 1 x 3 Qeq = ee a 20) X (41.5/6000) = 0.0029 
leq = 0.0: 
Equivalent 1 X 1 system Afeq = 15.8 MHz 
replacing 1 X 7 G7. = 6 Aes ~ 20) X (15.3/6000) = 0.0011 
leq = 0.0 


The long-haul objective (using D, to denote the length of the switch- 
ing section) is 
T.» = 1600D,/2000 sec, (26) 


in terms of which the recommendations for the application of space 
diversity are: 
T eq < Tos, space diversity not needed, (27) 
T eq > 1.5 To», space diversity needed. (28) 


Use of space diversity is optional in the tolerance band (To. S Teq 
< 1.57), which reflects parameter uncertainties and the fact that 
the approximation used to determine 7'., overestimates the unavail- 
ability. According to the above criteria, space diversity is needed in 
the example in Fig. 27 (assuming a one-hop switching section with 
40-dB fade margin). 

If space-diversity protection is needed, then it should first be applied 
to the worst hop in a switching section. In subsequent calculations, 7’, 
from eq. (6) should replace 7. for space-diversity-equipped hops, 
since improvement due to space diversity is normally dominant. 


APPENDIX E 
Comparative Switching With Hysteresis 


The analysis is carried out in terms of the envelope voltages (#1 and 
R2) of the signals from the two receiving antennas. The antennas are 
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assumed to be of the same size and R, and Re are normalized to be 
unity in the absence of fading. The envelope RF of the diversity signal 
is a composite of Ri and R2 obtained by switching; R becomes RA, when 
Ri > bRz.and R becomes Rez when Rz > bR:. The parameter b (b = 1) 
describes the hysteresis of the switch. 

The probability of a fade of 20 log Z dB is the probability that 
R < L. Regions in the Ri, Ro-plane over which the joint probability 
density function p(Ri, Re) has to be integrated to obtain this prob- 
ability are shown in Fig. 28. This shows that 


L L 
Mire t= f dR: dR1p(Rx, Re) +3 | dR, i dRxp(Rx, Re) 


over triangle 1, 


1 (29) 
+ 5 [ dR» [ dRsp(Rs, Re) 
over triangle 2. 
For deep fades and antennas of equal size,® 
p(Ri, Re) = 4q1hiRe. (30) 
Substitution and integration gives 
Pr(R <1) 2st (31) 


In the absence of hysteresis, b is unity. When hysteresis is introduced, 
b becomes larger than unity, and the probability of fading is increased 
by a factor of 0.5(b? + 6). 


: IN THIS REGION A SUITABLE APPROXIMATION 
Ro = bDRin sis: HALF THE TIME R IS Ry AND 
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Fig. 28—Comparative switching with hysteresis. (a) Switching diagram. (b) 
Areas in which R < L. 
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APPENDIX F 
Threshold Switching Calculations 


Below threshold the switch cycles, and R is Ri or Rz in equal amounts 
of time; the below-level probability for below-threshold operation is, 
with A denoting the threshold value, 


Pr (R < L) = 4 Pr (Ri < L, Bz < A) 
+2Pr(Ri <A,R,<L),L <A. (82) 


Because of symmetry in the joint probability density function, this 
simplifies to 


L A 
Pr (R < L) = f dR3 / dRsp(R:,R2), L <A (33) 
0 0 


for antennas of equal size. The below-level probability in above- 
threshold operation is (invoking symmetry as above) the sum of two 
terms. The first term describes the situation when FR is Ri, the second 
when F has become Re upon R, dropping below threshold: 


Pr(R < L) =Pr(A < Ri < L) 
+ Pr(Ri<A,Re< DL), LD>A. (84) 


These equations are based on results obtained for correlated Rayleigh 
distributed variables.2 Use of the probability density function from 
eq. (30) in (33) gives a deep fade approximation: 


Pr(R<L)&2& qa, L<A, (35) 


which shows the L? behavior below threshold. 

The curves in Figs. 10 and 11 were calculated (as outlined above) 
for correlated Rayleigh-distributed variables with g determined from 
the empirical expression for J) (machine computation was necessary 
because the integrals cannot be evaluated in closed form). The Rayleigh 
distribution was used in the theoretical calculations because its previous 
uses have provided insights valuable to the statistical description of 
space-diversity operation. The theoretically predicted functional forms 
have been verified by using actual fading records as inputs to com- 
puter-simulated threshold switches.” 


APPENDIX G 
Common Control Switching 


Space-diversity switching is normally implemented on a per-channel 
basis: each radio channel has its own switch, and switching decisions 
for a channel are based, in Fm systems, on carrier levels received on 
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Fig. 29—Improvement in a controlled channel as a function of frequency separation 
from controlling channel (4-GHz band, 30-foot antenna separation, 26-mile path, 
40-dB fade margin). 


the upper and lower antennas. Use of the carrier in one channel (con- 
trolling channel) to control switching of other channels (controlled 
channels) would reduce costs. For example, diversity transmission 
could be controlled by the received carrier in another channel, thus 
eliminating the feedback link; similarly, a single broadband switch 
could be used to switch a number of radio channels. Unfortunately, 
the improvement obtained in such common control schemes is too 
small to be of practical value. 

The available improvement J,, in a controlled channel depends on 
Af, the frequency separation of the carriers in the controlling and con- 
trolled channels. When Af is zero, J,, is equal to the available improve- 
ment J in the controlling channel, with J) given by (7). In the limit, 
as Af increases and becomes large, 7,, approaches unity as fading at 
the two frequencies becomes independent. Theoretical estimates*® show 
that the initial decrease in 7,, as Af increases from zero is rapid. As 
an example, the approximate behavior of J,, as a function of Af in the 
4-GHz band is shown in Fig. 29 for a 30-foot antenna separation on a 
26-mile path and with a fade margin of 40 dB. The value of J, at 
Af = 0 is about 100. At one channel separation, Af = 20 MHz, I... 
has decreased to about 3. Clearly, the improvement in the controlled 
channels is too small to be of practical value. The lack of performance 
in common control switching is simply a consequence of the fact that 
deep multipath fades are highly frequency selective. 
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APPENDIX H 


Results of Palmetto, Georgia, Experiments in 1968 


Power received at 4.198 GHz on five vertically separated antennas 
(Fig. 30) was measured on a 24.6-mile path from Atlanta to Palmetto, 
Georgia over a 72-day period in 1968 (August 16—-October 27). Five 
inputs of mipas2! (Multiple Input Data Acquisition System) were 
used to acquire the data. A power reading from the horn reflector on 
top was obtained first; about 2 milliseconds later, a reading from the 
top dish was obtained, and so on down the tower. This scanning was 
repeated at a rate of five times per second. The readings were con- 
verted to a logarithmic scale and recorded in digital form (nominally 
1 dB quantizing) on magnetic tape for subsequent computer processing. 
In the absence of fading, to conserve tape, the recording rate was less 
than the sampling rate. 

Improvement as a function of fade depth for the ten antenna pairs, 
with separations ranging from 4 to 60 feet center-to-center, was ob- 
tained by dividing at a given fade depth the measured time below level 
for an antenna (7) by the measured simultaneous time below level 
(T,). The equation for improvement, 


Io = gL”, (36) 


was fitted to the points in the deep-fade region to determine a value of 
q for each antenna separation (v? = 1, since gains were set to equalize 
free-space levels). The values of g so obtained are shown in Fig. 31 
as a function of antenna separation; because of random differences in 
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Fig. 30—Receiving antennas at Palmetto, Georgia in 1968. D1 to D4 are 4-foot- 
diameter parabolic reflectors. 
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Fig. 31—Variation of correlation parameter q: (a) Georgia 1968, and (b) eq. (9). 
Increasing values of g denote increasing decorrelation. 


T from antenna to antenna, there are two values of q for each antenna 
separation, depending on whether T for the upper or the lower antenna 
was used to form the ratio T/T. 

As a function of antenna separation, the correlation parameter q 
first increases as the square of the antenna separation, but then levels 
off and decreases (dotted line in Fig. 31); the presence of a small 
dominant component in the multipath spectrum is the most likely 
cause of this. Compared to eq. (9) (see also solid line in Fig. 31), q 
is enhanced for smaller separations, but is reduced for separations over 
about 50 feet. Dominant components being random on paths designed 
for negligible ground reflections, the enhancement cannot be counted 
upon when choosing an antenna separation; however, the possibility 
that gq may decrease for large separations must be provided for. This is 
the reason why the applicability (for design purposes) of (7) has been 
limited to antenna separations of 50 feet or less. 
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A New, Fast-Converging Mean-Square 
Algorithm for Adaptive Equalizers With 
Partial-Response Signaling 


By K. H. MUELLER 


(Manuscript received December 5, 1972) 


A new, generalized mean-square algorithm is presented to adjust the 
taps of a transversal adaptive equalizer. Any knowledge of the channel or 
the signaling format can be taken into account and will speed up the con- 
vergence process. The main applications are seen in partial-response sig- 
naling, where the new algorithm eliminates the interaction between the 
individual tap increments. This is achieved by decorrelating the compo- 
nents of the gradient in a fixed weighting matrix prior to the adjustments. 
Convergence is then extremely fast. If the channel has only phase distor- 
tion, a single iteration is sufficient to obtain the optimum tap vector (for 
any tuming and carrier phase). This is verified by computer simulations. 
Finally, the new equalizer is compared with another structure recently 
proposed, and some advantages of the new system with regard to imple- 
mentation and flexibility are pointed out. 


1. INTRODUCTION 


Together with sophisticated modulation techniques and bandwidth- 
conserving signal designs, automatic transversal equalizers are the 
prime characteristics of the new generation of high-speed data modems 
that have appeared on the market in the past few years. These modems 
allow data rates in the range of 3600 to 9600 b/s to be transmitted 
over voice-band channels. In addition to point-to-point transmission 
between computers and data processing centers, a large demand for 
high-speed data sets exists in multiparty polling systems. Because a 
message in such a system may consist of only a few hundred bits, it is 
essential that the start-up time of the modem be very short. Prefer- 
ably, the modem start-up time should waste fewer bits than are 
contained in an average message to guarantee a reasonable system 
throughput. This would basically imply that start-up time should be 
inversely proportional to the data rate; this is a very contradictory 
requirement, since more accurate and complex operations are usually 
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required during start-up of a high-speed modem than with a low-speed 
version. Timing recovery, carrier recovery (if required), and training 
of the automatic equalizer are the most important start-up operations. 
The time that must be allowed for the equalizer training is usually 
the major delay in start-up; thus, it seems worthwhile to concentrate 
some of our efforts in this direction. 

During the past few years, much attention has been given to partial 
response signaling.'-* Some of these waveforms are particularly 
attractive for data transmission over band-limited channels, since they 
have a spectral zero at the Nyquist frequency. These signals can easily 
be generated with virtually zero excess bandwidth, and yet their time 
response decays faster than that of a sin (x)/xz pulse. These advantages 
result from a controlled amount of intersymbol interference in these 
signals, which increases the number of levels in the baseband eye 
pattern. In some formats, the impulse response has odd symmetry 
and the spectrum is zero at de (which is of particular interest if ssB 
modulation is used). If a traditional transversal automatic equalizer 
with an MS (mean-square) gradient algorithm®.’ is used for partial 
response signals, the initial convergence will be very slow. Chang has 
shown that the spread of the eigenvalues of the signal autocorrelation 
matrix can become very large for Class IV partial response signals, 
and he proposed a new equalizer structure that eliminates this problem 
and thus can give fast convergence.’ Of course, with the appropriate 
modification of some parameters, his conclusions can be extended to 
other partial-response signals as well. 

In this paper, we propose an alternate solution to the problem of 
fast partial-response signal equalization. Instead of changing the 
equalizer structure, we propose to modify the tap updating algorithm. 
The coefficients associated with the new algorithm are given by a 
simple matrix inversion. 

The advantages of the new equalizer structure are not limited to 
partial-response signaling. Any knowledge of the channel or the 
signaling format can be used to speed up the convergence process. 
The classic transversal equalizer evolves as a special case (Nyquist 
signaling) of the more general configuration that is proposed in this 


paper. 


ll. A GENERAL MEAN-SQUARE ALGORITHM 


Let us assume a transversal equalizer structure with N taps, with a 
tap signal input vector x, and a tap coefficient vector c at time fo + kT. 
The equalizer output y; is given by 


Ye = XzEC (1) 
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and will not, in general, be exactly equal to the desired reference value 
d;, the error being 


Ck = Yk — dk. (2) 
The Ms error that appears at the output will thus be 
& = Bie}, (3) 


and we now choose a tap-updating algorithm that adjusts c in such 
a way that the Ms error (3) is minimized. The gradient of ¢ with 
respect to c can easily be obtained by combining (1) to (38), 


g=5% = {e.x.} = Ac — v, (4) 
where we have introduced the signal autocorrelation matrix, 
A = E{x.xf}, (5) 
and the signal correlation vector, 
v = E{xzdk}. (6) 


The optimum tap vector in the mean-square sense that minimizes (3) 
is obtained when we set the gradient to zero, 


Copp = Amv. (7) 
We propose a tap-updating algorithm of the form 
Cn41 = Cm — On2inj (8) 


where g», is the gradient evaluated with tap-vector c, after the mth 
iteration, and Q,, is a nonsingular matrix, which is discussed below. 
Obviously, an algorithm of the general form (8) will stop updating 
(turn itself off) when c = Cop. If the algorithm converges at all, it 
will thus converge to the Ms solution (7). The traditional steepest- 
descent gradient algorithm is obtained as a special case from (8) 
if we set Qn = GI (or a more general diagonal matrix instead of the 
unity matrix J if tapering should be included). 

The algorithm (8) can be written in the form 


Cn41 = Cf = QmA)Cm + OQnV. (9) 
If we introduce the tap error vector 
Om = Cm — Copt, (10) 


it is easy to show that 


C1200. Soo (11) 


n=1 
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Obviously, the convergence can be controlled to some degree by the 
selection of Qn, since this choice affects the eigenvalues of J — Q,A. 
This is discussed in Section ITI. 


Ill. SELECTION OF THE Q MATRIX 


First we will make some general remarks without specifying that 
the signal format be partial response. If the channel characteristics, de- 
modulating carrier phase, and timing instant were perfectly known, 
we would select Q = Q, = A and then obtain cop; after the first 
iteration. Actually, if that knowledge were available, it would, of 
course, be easier to preset the taps to Cope or to introduce a suitable 
fixed compensation network. Unfortunately, the transmission channel 
is not completely known. If, however, the spread within these channels 
is not extreme, we can reduce the average equalizer training time if we 
select a good estimate of the optimum tap vector for initial presetting. 
For example, 

ci = E{A;'vi}, (12) 


where the expected value is taken over the ensemble of channels, and 
a similar estimate for Q, 


Qm = BmLE{ As} J. (13) 
For a particular channel with 
A; = E{Ai} — A, (14) 
the critical matrix B; = I — QnA; then becomes 
B; = (1 — Bn)I + BelLE{ As} TA: (15) 


If we select 8, = 1, the average B; matrix is equal to the zero matrix. 
For a randomly selected channel, the B; matrix will, on the average, 
be much closer to the zero matrix than with a gradient algorithm, and 
faster convergence may be obtained. Modern digital signal-processing 
techniques allow quick and easy change (or selection) of (12) and (13) 
with different read-only memories at a fraction of the costs that are 
involved when traditional compromise equalizers have to be changed. 
Furthermore, even different signal formats may be processed effi- 
ciently with the same modem. 


IV. APPLICATIONS TO PARTIAL-RESPONSE SIGNALING 

Our results have so far been quite general, as we have not yet 
defined a special signaling technique. As was already mentioned in 
Section I, we see a particular promising application to processing zero 
excess-bandwidth partial-response signals. For such signals, the A 
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matrix depends only on the amplitude characteristics of the channel 
and is independent of the phase parameters® (timing phase, carrier 
phase, and phase characteristic of the channel). Since the amplitude 
response of voice-grade channels is relatively constant over the trans- 
mission band (or at least under much better control than the phase 
response), we may select an estimate 


Qm = BmAg', (16) 


where Ao is the ideal partial-response signal matrix. More specifically, 
for Class IV signaling, the matrix elements of Ao are 


[Aolix = bu — $6ie42 — 35in-2 (17) 
and for duobinary are 
TAolix = 5: + 36se41 + 26:21. (18) 
The inverse of both (nonsingular) matrices may be expressed as 
Ap? = T-Q?=I1+6G+@4+:::, (19) 


which shows that the elements of Ag?! are rational numbers, and are 
symmetric with respect to both matrix diagonals. 

Having selected an ideal channel as an estimate for Q, our algorithm 
takes the form 


Omy1 = (I — BmAg*A)om = Bndm. (20) 


Obviously, if we have only phase distortion, then A = Ao and we 
may select Bm = 1. The equalizer will then reach its optimum tap 
vector within a single iteration because the matrix B,, is zero. 

If the channel has amplitude distortion, then A # Ao and thus 
B,, #¥ 0. The eigenvalues \; and eigenvectors w; of Ag*A are defined by 


Ao Aw; = \,Wi. (21) 


Following a procedure shown by Chang,* we premultiply both sides by 
wiA, and obtain 
wiAw; 
= widow; (22) 





Since both A» and A are symmetric, positive, definite matrices, we 
conclude that all \; are positive. Chang® has further shown that the 


eigenvalues can be bounded by 
| 1 (w) ean = ri = | H (w) anaes (23) 


where H(w) is the quotient between the actual and the estimated 
channel transfer function. For convergence, the spectral radius p of 


MS ALGORITHM 147 


B,, must be less than unity; this is guaranteed if we choose 


2 


Amax 


0< Bn < 





(24) 


since the eigenvalues yu; of B,, are given by 
Bie = 1 7 Binre: (25) 


If the amplitude distortion is small, Ag1'A is close to the identity 
matrix, and the eigenvalues ); will be closely scattered around unity. 
We would then select Bn const Y1, and the matrix B would be 
close to the zero matrix with all eigenvalues yu; close to zero. The tap 
error vector after m iterations is given, from (20), 


dm = (Ul — BAg*A)™b0 = Br do (26) 


if 8 is constant (this is, of course, replaced by the usual matrix product 
if 8 = Bn). The squared magnitude of $ and the excess MS error are 
given by the expressions 


lom|? = o5B"$o (27) 
& = 95 B"AB" do, (28) 


and can be used to estimate the convergence speed. The procedure 
and the results are practically identical to those of Chang® and are, 
therefore, not repeated here. 


V. IMPLEMENTATION 


Figure 1 is a basic block diagram of the new equalizer. A version with 
only three taps is shown for clarity, although a much larger and prefer- 
ably even number of taps would actually be used. The upper part 
represents a traditional ms equalizer with the well-known circuitry 
for correlating the error signal with each tap signal to obtain the 
gradient g = Ac — v. Because of the particular structure of A for 
partial response signals, the components of g are highly correlated. 
The tap corrections obtained with a direct gradient algorithm would 
thus not be independent of each other. This interaction (which would 
cause a very slow convergence) can be eliminated if the gradient is 
“decorrelated”’ in the fixed weighting matrix Q shown in the lower 
part of Fig. 1. The resulting outputs are now decoupled and can be 
used to adjust the taps free of interaction to obtain rapid convergence. 

A somewhat different arrangement is shown in Fig. 2. Here the 
(same) Q matrix is placed between the tap signals and the correlators. 
It is obvious that this arrangement is equivalent to Fig. 1 because the 
error signal is common to all correlator inputs. 
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Fig. 2—Use of Q matrix in a second implementation of the new algorithm. 
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Fig. 3—Use of P matrix to obtain a modified equalizer structure. 


As a comparison, we show in Fig. 3 a three-tap equalizer proposed 
by Chang.® Here the input signal is first transformed by a weighting 
matrix P into a set of orthonormal components that are then processed 
as in a conventional equalizer. It is, in fact, identical to a conventional 
equalizer except that the input signal vector x is replaced by Px, 
where P is a suitably chosen matrix such that P™P = A '. Initial 
presetting of the equalizer and constraining the range of the variable 
gain coefficients can, however, be quite complicated with this structure. 
These difficulties are avoided in the new proposal, since the equalizer 
structure remains unchanged and only the updating algorithm is 
modified. 

Although the number of multiplications is the same in all three 
structures (Figs. 1 to 3), we would like to point out one very important 
difference: in Chang’s structure, the accuracy of the transformation 
is very critical. The input signals must be represented by their full 
precision (say, 10 bits). To avoid round-off errors when all the products 
are summed up, more than 12 accurate bits of these products must be 
processed and can then (after summation) probably be rounded off 
to 10 bits for the transformed tap signal. Any errors introduced in this 
process will result in undesirable output noise that cannot be compen- 
sated for by the equalizer. In the new equalizer, however, only the tap 
increments are affected by errors in the matrix calculation. Obviously, 
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a much lower accuracy is needed; in the extreme case, the correct 
polarity will be sufficient. Even with comparatively large errors, the 
tap-vector noise can be kept low by suitable scaling of the increments. 
We thus conclude that the new equalizer has inherent advantages as 
far as complexity, cost, size, speed requirements, and time-sharing 
possibilities are concerned. Furthermore, simpler algorithms can be 
derived from this one. If, for example, the structure of Fig. 2 is used 
for an algorithm with tap signals quantized to one bit, the matrix 
operation is reduced to controlling the addition and subtraction of 
the coefficients and may, for a medium-sized equalizer, be cheap and 
efficiently replaced by a suitably programmed read-only memory. 
For such nonmean-square algorithms, the optimum Q matrix will, in 
general, of course, be different from A g?. 

A few words should be said about the structure of the matrix 
Q = Ag! in the special case of partial-response Class IV signaling. It 
can be shown that every second diagonal array is zero; furthermore, 
if N is even, every second row (or column) is a shifted version of the 
previous one. These properties are identical to those of Chang’s P 
matrix (the remaining nonzero coefficients are, however, different from 
those of the P matrix) and can be used to simplify the signal-processing 
operations. 

Finally, we would like to point out that an all-digital implementation 
for higher data speeds (group band) would still be extremely complex 
and expensive. For such applications, an array of operational amplifiers 
with suitable weighting resistors (probably mounted on a ceramic) 
would be a far more economic solution with the present state of the art. 


VI. SIMULATIONS 


To test the proposals made in this paper, a 15-tap Ms equalizer was 
simulated. A 4800-b/s, ssB-modulated, partial-response, Class IV 
signal was selected. A parabolic delay characteristic was assumed 
with 1-ms delay difference between the passband channel center and 
edges. Table I shows the coefficients of the matrix Q = Ag?. In Fig. 4, 
the equalizer convergence is shown for four different timing phases 
that are separated by 90 degrees each. After each iteration, the ms dis- 
tortion of the overall impulse response is plotted. It is seen that the 
minimum is basically achieved with the first iteration (the small resid- 
ual error after the first iteration results from the time-domain trunca- 
tion and round-off errors in the components of the initial impulse 
response during the simulation). Some additional simulations were 
made with an ms stochastic approximation algorithm (adjustments at 
the symbol rate) and with amplitude distortion; they also showed a 
clearly improved convergence when the decorrelation matrix was used. 
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Table | — Decorrelation matrix for N = 15 
(elements multiplied with 36) 


0 0 
0 36 0 54 0 72 0 90 0 108 
32 0 48 0 64 0 80 0 
0 18 0 27 0 36 0 45 0 
16 QO 24 0 32 0 40 0 48 0 56 


64 O 56 0 48 0 40 0 32 QO 24 0 16 
0 63 0 54 0 45 0 36 0 27 0 18 0 
56 0 112 0 96 0 80 0 64 0 48 0 32 
0 54 0 108 0 90 0 72 0 54 0 36 0 
48 0 96 O 144 0 120 0 96 0 72 0 48 
0 45 0 90 0 135 0 108 0 81 0 54 0 
40 0 80 0 120 0 160 0 128 0 96 QO 64 
0 36 0 72 0 108 QO 144 0 108 0 72 0 
32 0 64 0 96 0 128 0 160 0 120 0 80 
0 27 0 54 0 81 0 108 0 1385 0 90 0 
24 0 48 0 72 0 96 0 120 144 96 
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Fig. 4—Convergence of new algorithm for different timing phases. 
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Vil. CONCLUSIONS 


The spread of the eigenvalues of the signal autocorrelation matrix 
can become very large for signals with nonflat amplitude spectra. If 
the gradient with respect to the tap gains of an Ms equalizer is calcu- 
lated, its components are highly correlated. The resulting interaction 
of the individual gain adjustments may cause a very slow equalizer 
convergence. We have proposed a new Ms algorithm that eliminates 
these problems. With this new algorithm, a set of uncorrelated tap 
increments is calculated so that the individual adjustments are de- 
coupled from each other. This can be achieved in either of two ways: 
The gradient can first be calculated in the conventional way (corre- 
lation of error signal and tap signals) and then be multiplied by a 
decorrelation matrix Q to produce the final correction vector. On the 
other hand, tap signals can first be passed through such a matrix and 
the resulting outputs can then be correlated with the error signal. The 
matrix Q is chosen as the inverse of the expected (average) signal 
autocorrelation matrix A. The new algorithm was simulated for partial 
response signals with zero excess bandwidth. In the case of a flat 
channel, the optimum tap coefficients can then be determined within 
a single iteration, independent of delay distortion, carrier phase, and 
timing instant. This was confirmed by simulation. Compared with 
other recent proposals for fast start-up, the new algorithm features 
simpler presetting and requires less accuracy in the matrix trans- 
formation. Furthermore, this method can easily be extended to systems 
with nonmean-square cost functions. 
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Numerical Evaluation of Integrals with Infinite 
Limits and Oscillating Integrands 


By S. O. RICE 
(Manuscript received March 15, 1974) 


The numerical evaluation of slowly convergent integrals with infinite 
limits and regularly oscillating analytic integrands is discussed. Functions 
are presented that can be subtracted from the integrands to increase the 
rate of convergence. One of the integrals used as an example is an often- 
studied Fourier integral related to the power spectrum of a phase-modu- 
lated wave. It is pointed out that, when Fourier integrals whose values are 
known to be positive, e.g., probability densities, are evaluated by the 
trapezoidal rule, the values given by the trapezoidal rule are never less than 
the true values. 


I. INTRODUCTION 


This paper is in the nature of an addendum to an earlier paper in 
this journal dealing with the numerical evaluation of definite integrals.! 

Integrals with infinite limits and oscillating integrands often arise 
in technical problems. In this paper, we assume that the integrand is 
an analytic function of the variable of integration u (in a suitable 
region) and tends to oscillate at a regular rate as u—> ©. The main 
concern here is how to deal with cases in which the rate of convergence 
is slow. 

A number of ways have been proposed to handle the slow con- 
vergence. One is to use integration formulas of Filon’s type. In particu- 
lar, E. O. Tuck? has given a formula of this type suited to an infinite 
range of integration. Another method is to evaluate the contributions 
of positive and negative loops of the integrand and apply Euler’s 
summation formula for slowly converging alternating series. Still 
another is to (2) write the integrand in terms of complex exponentials, 
(72) deform the path of integration so that it becomes a path (or 
portions of paths) of steepest descent in the complex u-plane, and then 
(122) integrate along straight-line segments that approximate the path. 
A closely related procedure is to tilt the path of integration, if possible, 
so that the integrand will decrease exponentially. 
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In this paper most of our attention will be devoted to still another 
method of improving the convergence, namely, that of subtracting 
from the integrand functions that behave like the leading terms in the 
asymptotic expansion of the integrand. This method is quite old. It is 
related to a method used by Kummer to deal with series, and its 
application to integrals is described by Krylov. 

Sections II through V are concerned with functions useful for 
subtraction when the limits of integration are 0 and . Section VI 
deals with the evaluation of 


J(b, a) = e° | [exp (bu-! sin u) — 1 — busin u Je*“du, (1) 


which serves as a typical example of a slowly converging Fourier 
integral with limits +0. 

The integral (1) has been the subject of several papers because of its 
relation to the problem of calculating the power spectrum of a phase- 
modulated wave (see Prabhu and Rowe‘ and the references they 
give). In Section VI, results of trial calculations are presented that 
show that (1) can be efficiently evaluated by the trapezoidal rule after 
sultable functions have been subtracted from the integrand. 

Sections VII and VIII are concerned with the error introduced when 
a general Fourier integral with limits + is evaluated by the trape- 
zoidal rule. 


Il. PROPERTIES OF FUNCTIONS SUITABLE FOR SUBTRACTION— 
LIMITS 0 AND ~ 


Let the integral to be evaluated be 


f= io Gait: 


and let the leading term in the asymptotic expansion of f(u) for large 
u be u~’ exp (zau), where ais real and v > 0. Quite often, in performing 
numerical integrations, this leading term is subtracted from the inte- 
grand only for values of u that exceed some large value U. We have a 
slightly different procedure in mind here. 

Here we want to use the trapezoidal rule methods described in Ref. 1. 
Since these methods require the integrand to be analytic, we assume 
that f(u) is analytic in a region containing the positive real axis. Our 
aim is to subtract an analytic function (which approaches u-’ exp (tau) 
as u—) from f(u) over the entire interval (0, ). When this is done, 
the resulting integral has an integrand analytic over (0, ©) and can be 
evaluated by the trapezoidal rule methods described in Ref. 1 (see 
especially Section IX). 
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Functions suitable for subtraction when the limits are 0 and 
should have the following properties: 


(tz) Asu—o, they should be asymptotic to u-” cos au or u~’ sin au 
or, more generally, to 


u-” exp (iau), (2) 


where vy > 0 and ais real. 

(it) They should be easy to compute and they should remain finite 
at u = 0. 

(4iz) Their integrals between the limits (0, ©) should be easy to 
compute. 


In some cases, moderately simple functions can be used for sub- 
traction. If the leading term in the asymptotic expansion of the 
integrand is (sin au) /u, then we can subtract and add the integral 


[ u— sin au du = oi : . . (3) 
Similarly, for (cos au)/u we can use 
i. ° u[cos au — exp (—au) Jdu = 0, (4) 
and, for (cos au) /u?, 
a (cos au)du/(1 + wu?) = 42 exp (—|a}). (5) 


The last integral (5) has the disadvantage that its integrand has the 
asymptotic expansion 


cosau  cosau 
el 


and if we desire to subtract terms of both O(u-*) and O(u-*) from the 
original integrand, the term — (cos au)/u‘ in (6) has to be taken into 
account. In the applications we have in mind, the exp (—avw) in (4) 
decreases so rapidly that it does not have to be taken into account 
in the way that — (cos au)/u* does. 








$v (6) 


Il. GENERAL FUNCTIONS FOR SUBTRACTION—LIMITS 0 AND © 


Here we present a set of functions that meets the ‘‘ease of compu- 
tation” requirements (77) and (277) reasonably well. It is quite likely 
that there are better sets, but this one is the best I have been able to 
find. There are two formulas according to whether » is an integer or not. 
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The functions are the integrands in 


Ltn = _ + ket, (7) 


a) eee) 
XE(n — kha + kB] UnE(n — kha + kB], (8) 


where n is a positive integer and n, a, 8, and e« are such that the integrals 
converge. In (7), € is not an integer, and (e)o = 1, (e)n = e(e + 1)--> 
(e +n — 1). In using (7) and (8), we set a = — ia/n, B = |a|/n. 
In (7), n and e are chosen so that e < — 1 and n + e is equal to the 
exponent v in (2). The choice e < — 1 is made to make the integrand 
and its derivative in (7) approach 0 as u — 0. This tends to reduce the 
error in the numerical integration. Since a can be either positive or 
negative ina = — ia/n and B = |a|/n, we take 


—nr/2S arg[(n—k)at+k@] S 7/2 


when calculating the logarithm and the (n + «€ — 1)th power of 
L(n — k)a + kB]. 

Equation (8) is obtained by letting «— 0 in (7), and (7) is based 
upon (page 243 of Ref. 5) 


ra-a-d= fee ler- = oP, (9) 


L=0 





where, in contrast to (7), ¢ is restricted to 0 < e < 1 to secure con- 
vergence. To obtain (7), expand [exp (—au) — exp (—fu) ]" by the 
binomial theorem. Then to each term exp [— (n — k)au — kBu], add 
and subtract the series 


= = (—u)'E(n — ka + kp]!/Ul. 


Equation (7) then follows from (9) with t = [(n — k)a + k8 ju and 
the relation 


(Cen — Ba + Ker = 0, (10) 


wherel = 0,1, ---,n — 1. Equation (10) expresses the fact that wu” is the 
lowest power of u in the power series for [exp (—au) — exp (—fu) ]". 
After (7) has been established for 0 < e < 1, it can be extended by 
analytic continuation to the values of e needed in the application of (7). 

Other choices of a and 8 besides —ia/n and |a|/n can be made in 
(7) and (8). In some cases, it may be better to take a = — ia/n and 
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8 =a-+b), where 6 is a positive constant at our disposal. For this 
choice of a and 8, [exp (—au) — exp (—u)]" becomes exp (iau) 
xX [1 — exp(—bu)]" and (n — k)a + kB becomes kb — ia. By using 
these values in (7), equating imaginary parts, and letting «1, we 
can obtain 


I ° u""(1 — e~)" sin au du 
0 
es (—y(F (kb + ia)?In(kb + ia). 
n! k= k 
Here the integrand tends to (sin au)/u"t! as u— @. 


IV. EXAMPLE ILLUSTRATING APPLICATION OF EQ. (8) 


Consider the integral 


es / “(1 + w) edu, (11) 
0 


which is equal to Ko(1) + tr[Jo(1) — Lo(1)]/2, where Ko and J are 
Bessel functions and Zo is a Struve function (page 498 of Ref. 6). As 
u—o, the integrand approaches 


uate — Lyieiu 4... 


Therefore, if we desire convergence like that of u-*e“, we apply (8) 
twice. First with n = 1, a = —7i, 8 =1, and then with n = 3, 
a= 1/3, B= ce 


roe} Pe re ete — e-w 1 etul3 — eau 3 
ie i, au| + w) te (S== )+5 (“= ] 
+ a [—In(—1) + In(A)] 
= 551 (;) OALG - H(i) + HBP 
X In (8 — k)(—7i/3) + k/3]. (12) 


The second line reduces to 77/2 [which, incidentally, gives the com- 
bination of (3) and (4) ]. Although the last sum appears complicated, 
it can be readily evaluated by a digital computer when the program 
is written using complex variables. 

The integrand in (12) now decreases as u~*e*” when w is large, and 
the integral can be evaluated by any suitable method of numerical 
integration. Some computations were made using the trapezoidal 
rule as described in Section IX of Ref. 1. In this method, the variable of 
integration is changed from u to x, where u = aln[1 + exp (2/a) ], 
and the integral in x is evaluated by the trapezoidal rule. A computa- 
tion using 52 points withh = Az = 4.0anda = 6 gavel = 0.42102--- 
+ 720.87308---, which agrees with tabulated values. 
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This example brings to mind the question, when does one cease 
subtracting terms from the integrand and start integrating? That is, 
what is the trade-off between computing trapezoidal sums and those 
sums in (7) and (8)? There seems to be no well-defined answer, but 
experience indicates that there is usually no need to make the integrand 
decrease faster than the inverse fifth or sixth power of the variable of 
integration. 

Incidentally, the integral for J is quite readily evaluated by tilting 
the path of integration. Arbitrarily choosing a 45-degree tilt and setting 
u = (1 + 2)t carries (11) into 


io f, ” e-*f (8) df, 
f® = A+ + 2) te, (13) 


The contribution of the 45-degree arc at |u| = © is zero by Jordan’s 
lemma (page 115 of Ref. 5). The integral is now in the form discussed in 
Section VI of Ref. 1, and can be efficiently evaluated by the trapezoidal 
rule after the change of variable ¢ = e?, v = x — e-*. Computations 
made with this example suggest that when “‘tilting’’ can be used, it is 
preferable to ‘‘subtraction.’’ However, tilting cannot always be used, 
an example being the integral J(b, a) defined by (1). 


V. EXAMPLE ILLUSTRATING THE USE OF EQ. (7) 
Consider the integral 


I = i "(1 + ut)8sedu, (14) 
0 


the real part of which is related to the Bessel] function K1;3(1). When 
u—o, the integrand becomes 


Therefore, to secure convergence of O(u-"/3), we apply (7) once with 
n+ ¢€ = 5/3. Since n must be a positive integer, possible choices of 
are 2/3, —1/3, —4/3, ---. We also want « < — 1. We arbitrarily 
choose e = — 4/3 (and therefore n = 3) to make the integrand of (7) 
tend to 0 as u‘/? when u— 0. Because n = 3 we take a = — 7/3 and 
6 = 1/3. Then 


[= in du{(1 + u?)—5/6et — y-5/3(eiuls — e—ul3)3)] 
0 


T'(7/3) aS ie emf 2 
+ (473)(=1/3) @/3) 2 \~? a 


X LB — k)(—7/8) + (&/3) P*. (15) 
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This integral can be evaluated by the trapezoidal rule in much the 
same way as was the integral in (12). 


VI. NUMERICAL EVALUATION OF THE INTEGRAL J(b,a) 


To simplify the discussion of J(b, a) defined by the integral (1), we 
assume that b is positive. The parameter a is real and J(b, a) is an 
even function of a. 

The integrand in J (b, a) can be expanded as exp (aw) times 


Dy (bu sin u)”/n}, 


and the term for n = 2 shows that the integral converges slowly. 
Inspection shows that the rate of convergence can be increased by 
writing J(b, a) as an integral from 0 to © and subtracting terms of 
the form u-~" exp (zcu) as described in Sections III and IV. However, 
it is more convenient to subtract terms of the form (u~!sin u)” and 
use the known values of 


Ba = t. (bu! sin u)” cos au du/n}, (16) 


where n = 1 is an integer and B, is an even function of a. 

When n = 1, Bi is equal to bz for |a| < 1, to br/2 for |a| = 1, 
and to 0 for |a| > 1. When n 2 2, B, is a continuous function of a 
which is 0 when |a| = n. When n = 2 and |a| S n, 


B, = ei (=) (n +a — 2m), (17) 


where M is the integer part of (n + a)/2. The choice of the sign in + 
is arbitrary, but it must be the same in M and in (17). The minus sign 
is the more convenient choice when a > 0. 

When we desire convergence at the rate of, say, 1/u®, we subtract 
and add e>(Bs + B3 + Bs + Bs): 
J(b, a) = e~ i | exp (bu-! sin u) — 1 


ee} 


2 (bu-! sin u) */n] eudy + e°(B. + Bs + Bs + Bs). (18) 


Incidentally, J(b,a) is equal to e~’ times the infinite sum By, 
+ Bs; + ---, a result useful for computation when b is small. 

The integrand in (18) is an analytic function of wu, and the results 
of Ref. 1 suggest direct numerical integration by the trapezoidal rule. 
Several trial computations were made to obtain an idea of the perform- 
ance of this method of integration. In the actual work, the integral 
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Table |— Values of J(b,a) obtained by the trapezoidal rule 


b a h N Umax J (b, a) 
1 1 0.7 12 7.7 0.413 5433 
1 4 0.4 20 7.6 0.000 0428 
4 1 0.5 36 17.5 1.341 1671 
4 4 0.4 45 17.6 0.011 6253 
16 1 0.3 39 11.4 0.997 3179 
16 10 0.225 52 11.5 0.000 2046 
32 1 0.25 19 4.5 0.736 6452 
32 10 0.175 26 4.4 0.007 6251 


was rewritten as an integral from 0 to ~, and exp (iau) was replaced 
by cos au. Typical results are shown in Table I. Here N is the number 
of terms in the trapezoidal sum, and h is the spacing between successive 
values of u. The trapezoidal sum was truncated at wu = Umax 
= (N — 1)h. I believe the values shown for J(b, a) are accurate to 
the number of places shown. A short table of J(b, a) for a S 1.25 is 
given in Ref. 7, where J(b, a)e® is referred to as ‘‘Lewin’s integral.” 
When 0 is large, the work of Prabhu and Rowe? gives tight bounds on 
J (b, a) for all values of a. 

The truncation value wmax was chosen to be the value of uw beyond 
which the absolute value of the integrand remains less than 107°. 
This makes the truncation error less than 10-°unax/5. It can be shown 
that Umax has a maximum value of about 18 near b = 6. 

The error & in the computed value of J(b, a) resulting from the 
finite size of the spacing h, i.e., the ‘trapezoidal error,” is of the order 
of J(b, 27h! — a) when A is small. This follows from Section VII 
below. The trapezoidal errors observed in the construction of Table I 
agree well with the values of J(b, 27h—! — a) estimated from bounds 
for J(b, a) given by Prabhu and Rowe.‘ £ decreases rapidly as h 
decreases through the values shown in Table I. 


Vil. TRAPEZOIDAL ERROR FOR FOURIER INTEGRALS 


Let the Fourier integral and the associated trapezoidal error be, 
respectively, 


i@= [ F(u)edu, (19) 
ee > F (nh) exp (ianh) — I(a). (20) 


As in Ref. 1, Poisson’s summation formula gives 


pe ( Yr +3 ) I(Qukh- + a). (21) 
k=—om k=1 
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When F'(u) is an even function of u, J(—a) is equal to I(a), and 
(21) gives 


pe oy [I(Qekh-? — a) + I(2akh- + a]. (22) 


It is interesting to note that, if J(a) is known to be real and non- 
negative for all real values of a, then EF is nonnegative and the values 
of I(a) obtained by evaluating (19) by the trapezoidal rule are always 
greater than or equal to the true value of J(a), truncation errors aside. 
This fact is of use in dealing with power spectra and probability 
densities. 

Now let F(u) be analytic in a strip in the u-plane containing the 
real u-axis. When F'(u) is even and analytic, usually only the k = 1 
term in (22) is important if h is small. In addition, if a > 0, HE and 
I(2ah-! — a) are usually of the same order of magnitude. 


Vill. DISPLACING THE PATH OF INTEGRATION OF A FOURIER INTEGRAL 


In some cases, the numerical evaluation of the Fourier integral J (a) 
defined by (19) can be facilitated by shifting the path of integration 
from the real u-axis to a parallel path that passes through u = ic, 
where c is some suitably chosen real constant. Changing the variable 
of integration in the integral (19) for (a) from u to x, where u = x + 10, 
and applying the trapezoidal rule to the integral in x gives the trape- 
zoidal error 


EB’ =h > F(nh + ic)e#n—-« — I (a), 


n=—o 


which, by Poisson’s summation formula, goes into 


E! = (, ae =) I (Qakh + a)er*telt, (23) 

When J(a) is known to be real and nonnegative for all real values 
of a, then EH’ is nonnegative just as H# is in (21), and the trapezoidal 
values of J(a) obtained from the displaced path are greater than or 
possibly equal to the true value of I(a), just as before. 

The integral in the expression (18) for J(b, a) is of the Fourier type 
(19), and some calculations were made to test the benefit obtained by 
shifting the path of integration. Along the path u = x + ic, the real 
part of the integrand is an even function of x, and the integral in (18) 
can be rewritten as an integral from + = 0 to ~ of twice the real 
part. For b = 32 and a = 10, the value of c was chosen to be 0.87. This 
corresponds to the saddle point wo = 70.87 of exp (bu7! sin wu + tau). 
A trapezoidal rule evaluation with h = 0.3 along the line u = x + 20.87 
gave J(b, a) to the accuracy shown in Table I. The number of terms 
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required was N = 16 compared with N = 26 when the integration 
was taken along the real u-axis (with h = 0.175). 
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Waves Through Rain 
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An experiment was designed to determine the effect of rainfall upon the 
attenuation and polarization of millimeter waves. Vertically polarized 
waves at 60 GHz were transmitted over a 1-km path, and the energy re- 
ceived in both the vertical and horizontal polarizations, as affected by rain, 
was detected and recorded at the receiving site; similarly, a horizontally 
polarized wave was transmitted and the same components measured. Data 
were collected over a period of one year (1970) using an all-solid-state mea- 
suring set. It was found that attenuation due to rain exceeded 42 dB for 
5 minutes, and that a system employing orthogonally polarized signals 
would have had to tolerate signal-to-crosstalk ratios as poor as 22 dB for 
that same period of time. 


|. INTRODUCTION 


Crowding of the frequency spectrum is forcing designers of micro- 
wave communication systems to use higher frequencies and to give 
serious consideration to orthogonally polarized waves for increasing 
communication capacity. If adequate orthogonality can be maintained, 
the capacity of a system is double that of a system employing only one 
polarization. However, it has long been known that imperfections in 
antennas and other system components limit the degree of ortho- 
gonality that can be maintained in a practical system. Rainfall on the 
transmission path can also affect polarization and thereby limit the 
degree of discrimination against the unwanted signals at the receiver. 
The experiment described here was set up to determine the extent of 
such depolarization as well as the attentuation of 60-GHz waves. 

The mechanism of attenuation and depolarization involves the 
geometry of raindrops. A number of workers have determined that 
large raindrops are seldom spherical and that they are capable of 
producing different amounts of attenuation for waves having different 
polarizations.!? At the time our experiment was in the design stage, 
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Semplak? had already observed, at a frequency of 30.9 GHz, that 
attenuation for horizontally polarized waves was significantly greater 
than for vertically polarized waves during rainstorms. In this paper, 
we report data at about twice that frequency. 


Il. DESCRIPTION OF THE EXPERIMENT 


The propagation path at Holmdel, New Jersey, is about 1.03 km 
long. The transmitting antenna is located on flat terrain about 30 
feet above the ground and the receiver is on the southern slope of 
Crawford Hill. Propagation is along a path approximately 15 degrees 
west of true north. The transmitted carrier frequency is 60 GHz, near 
the maximum of the oxygen absorption line.‘ A horn-reflector antenna 
with 51-dB gain is employed at the transmitter and a horn-lens 
antenna with 53-dB gain at the receiver. The net clear-weather path 
attenuation is 39 dB, which includes a 15-dB loss due to oxygen 
absorption. 

A single transmitter and receiver are employed. The polarization of 
the transmitted wave is switched from horizontal to vertical 10 times 
per second and the signal is gated on and off at a 20-Hz rate. The re- 
ceiver is switched to respond five times per second, first to a horizontally 
polarized wave, and then to a vertically polarized wave. As a result 
of the switching we provide four separate outputs corresponding to the 
following conditions of polarization : 


(t) Transmitter and receiver both horizontal, designated 7T;,Ry. 
(27) Transmitter and receiver both vertical, designated T,,R,. 
(272) Transmitter horizontal and receiver vertical, 7,R.. 
(w) Transmitter vertical and receiver horizontal, 7',Rn.* 


Output signals corresponding to these four conditions are recorded 
separately and constitute our data. The effects we measure are the 
differences in the fading suffered by the various components during 
rainstorms. 

The pulse output of each gate is detected and integrated, applied 
to a logarithmic amplifier, amplified, and recorded. It was not difficult 
to obtain a truly logarithmic output over an 80-dB range of input 
voltage, but it was difficult to obtain a truly linear detector output. 
By using one detector for the direct components and a separate one 
for the cross-polarized components, we are able to accommodate 50-dB 
fades with the recorded output a truly logarithmic function of the input 
over the complete range. 


*In this nomenclature, for example, a cross-polarization discrimination is T,R,/ 
TR, and a crosstalk ratio is T,R,y/T,Ro. 


166 THE BELL SYSTEM TECHNICAL JOURNAL, JANUARY 1975 


il. EQUIPMENT 


The fact that we are dealing with a narrowband measuring set in 
an application that requires a dynamic range of about 80 dB puts a 
premium on frequency stability. A considerable part of our effort, 
therefore, was directed toward obtaining such stability. 


3.1 Receiver 


We chose to injection-lock the receiving local oscillator to a stable 
30-GHz signal obtained by harmonic generation from a 150-MHz 
source to obtain both a high degree of frequency stability and a low- 
noise output. To provide means for injecting the locking signal, the 
oscillator cavity shown in Fig. 1 was constructed. The mmpattT diode 
is mounted at one end of the center conductor of this coaxial cavity 
which is one wavelength long at 60 GHz. A probe, one-quarter wave- 
length from the end, couples out 60-GHz energy into a section of the 
RG 98/U output waveguide. The 30-GHz energy is coupled from the 
RG 96/U injection waveguide into the cavity at a point one-half 
wavelength (at 60 GHz) from the end of the cavity. Since this is at 
a potential null for the higher frequency, coupling can be accomplished 
with minimum reaction to the 60-GHz signal. The 30- and 60-GHz 
waveguide sections are tuned by means of short-circuiting plungers 
adjacent to the coupling probes. From an inspection of Fig. 1 it is 
evident that oscillation and locking could take place at 30 GHz and 
produce a 60-GHz output due to the generation of second harmonic. 
Alternatively, 30-GHz oscillation could be suppressed by adjustment 
of the coupling probe, leaving the circuit free to oscillate at 60 GHz 
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Fig. 1—60-GHz impatr oscillator. Diode is mounted at the end of the center 
conductor of the coaxial cavity. 
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Fig. 2—60-GHz solid-state receiver with injection-locked local oscillator. 


and lock on the second harmonic of the 30-GHz signal. The latter mode 
of operation proved most favorable.* 

The mpatt local oscillator in the receiver is locked to a signal 
generated from the output of the 150-MHz, crystal-controlled oscillator 
(see Fig. 2). A high-level 300-M Hz voltage is applied to a step-recovery 
diode which has an output tuned to 15 GHz, thus providing a fre- 
quency multiplication of 50. The final locking signal, at a level of 0.2 
mW, is obtained by doubling the 15-GHz signal in a diode harmonic 
generator. The locking range with this power level is known to be a 
few tens of MHz, which is sufficient to override cavity detuning due to 
temperature changes. Output power of the local oscillator of the 
receiver was about 4 mW. 

For favorable adjustment, the FM noise output of the oscillator, 
when injection locked, was found to come largely from the injection 


*In a preliminary experiment, 3 mW from a tunable Gunn oscillator was applied 
to the 30-GHz input to the oscillator, which delivered about 15 mW at 60 GHz. The 
IMPATT output frequency could be varied over a range of plus and minus a few 
hundred MHz by adjusting the frequency of the Gunn oscillator but leaving the 
IMPATT tuning fixed, resulting in a very satisfactory frequency lock. 
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signal. With a frequency multiplication of 400, this noise is 52 dB above 
that of the 150-MHz oscillator. However, the most frequently ob- 
served situation was one in which the impaTr was noisy and the 
noise output was reduced by application of the locking signal. The 
cavity adjustments that resulted in stable, low-noise operation were 
obviously different from those that gave the widest frequency lock; 
the former condition was used in operation. 

In the receiver the local-oscillator output is applied to a balanced 
converter where the incoming signal is translated to an intermediate 
frequency of 160 MHz. The converter employs wafer-type, silicon, 
point-contact rectifiers of the type described by Sharpless.® A transistor 
preamplifier with 10 dB of gain and a 2-dB noise figure is coupled 
directly to the converter. Although extensive measurements have not 
been made, there is evidence that local-oscillator noise is not limiting 
the performance of our system. The 160-MHz signal is mixed with 150 
MHz in a second converter to provide a 10-MHz second intermediate 
frequency. After amplification, this 1F signal is detected by linear 
detectors. 


3.2 Transmitter 


With only 0.2 mW of 30-GHz locking signal available, it was not 
possible to obtain the high output power needed from the transmitter 
and at the same time have a really satisfactory frequency lock. For 
this reason, the transmitter is controlled by an arc circuit using a 
harmonically-generated 60-GHz signal as reference (see the block 
schematic in Fig. 3). With a feedback factor of nearly 1000, the trans- 
mitter output frequency is determined almost entirely by the cross- 
over point of the 160-MHz discriminator and, therefore, is largely 
independent of changes in cavity tuning. 


3.3 Performance 


Before the system was installed in the field, the transmitter output 
was coupled to the receiver and the resultant 160-MHz IF signal 
monitored with a frequency counter. The frequency stayed within a 
few tens of kilohertz over a period of many days. 

In the low-frequency section of the receiver, a logarithmic amplifier 
of the type described by Paterson,® simple and easily adjusted, pro- 
vided a truly logarithmic output over a range of 80 dB. Obtaining a 
linear detector to drive this amplifier was more of a problem, but by 
amplifying the 10-MHz signal to a level of about 50 volts at the 
detector, a linear output was obtained over an input range of more 
than 50 dB. The required 80-dB dynamic range was obtained by 
dividing the incoming signal into two branches, one to recover low- 
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Fig. 3—60-GHz solid-state transmitter with frequency-controlled oscillator. 
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Fig. 4—Polarization discrimination characteristics of the operating system. 
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Fig. 5—Polarization discrimination with respect to stability. Not plotted for 
weekends. 


level cross-polarization signals and the other to process the high-level 
signals. 


3.4 Polarization discrimination and stability 


After preliminary adjustments of the system on the path, a polariza- 
tion shifter was placed in the receiver input to rotate the plane of 
polarization of the incoming waves and thereby determine the polariza- 
tion discrimination of the system. The resultant outputs, plotted on 
Fig. 4, show that an excellent balance could be obtained for the 
condition 7,R,/T,R, but, as a result of system imperfections in the 
antennas and other system components, the maximum discrimination 
for the condition 7,R;/T,R, was only 32 dB, which indicates that 
slight elliptical polarization was present in the latter case. The system 
is normally adjusted to operate near the minimum of the horizontally 
cross-polarized component (see dashed line in Fig. 4), which at the 
same time provides more than 30 dB of discrimination against the 
vertically polarized component. Due to temperature or voltage changes, 
the operating point varied somewhat with time, as shown in Fig. 5. 
The gaps in the plotted data result from readings of stability not 
being taken during weekends.* Although no particular steps were taken 
toward stabilization, the discrimination ratio remained better than 
30 dB over long periods of time without readjustment. 


*This remark applies only to stability readings; all other data were accumulated 
continuously. 


WAVE PROPAGATION THROUGH RAIN 171 


iV. EXPERIMENTAL RESULTS 
4.1 Distribution of fades 


Although the primary purpose of the experiment was to obtain data 
concerning polarization, it also yielded one year of information on the 
distribution of rain fades. Figure 6 is a plot showing depth of fade 
versus the fraction of time this fading level was exceeded. We see, for 
this year, that for 60-GHz transmission over this 1-km path, a rain- 
fading margin of 40 to 42 dB would have been needed to meet a 
reliability objective of not more than 5 minutes of outage time (a 
probability of 10-5). These data are found to be in good agreement 
with attenuations predicted from point rainfall rates.’ 


4.2 Polarization crosstalk distribution 


The experimental results verify expectations that rainfall does 
degrade the polarization of 60-GHz waves, although much less than 
at lower frequencies.” The effect for this path is plotted in Fig. 7. The 
solid dots indicate crosstalk into the vertical channel (7,R./T>R,), 
and the open circles indicate crosstalk into the horizontal channel 
(T,.R,/T,R»). It is evident that, for our system, the polarization cross- 
talk ratio is 22 dB or more for 5 minutes during the year 1970. 

Figure 8 shows data for a heavy rainstorm of July 31, 1970. This 
storm behaved as expected in the sense that all quantities varied in 
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Fig. 6—Distribution of rain attenuation (1970). 
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Fig. 7—Distribution of the crosstalk ratio. 


the way one might expect from simple theory ; i.e., differential attenua- 
tion was always positive (fade in horizontal polarization was greater 
than in vertical). The crosstalk variations were pretty much the 
same for both polarizations, with the ratio becoming poorer during 
the deeper fades. 

On the other hand, the storm of November 4, 1970, plotted in Fig. 9, 
did not produce results expected from the simple theory. For a con- 
siderable portion of the time the differential attenuation was negative, 
indicating that attenuation of the vertical component was greater than 
that of the horizontal component. The fact that the crosstalk ratio in 
both channels (Fig. 9) improved slightly in this case may be explained 
by referring to Fig. 4, which shows the clear-weather operating point 
near +0.5 degree; a negative rotation (caused by the rain) of the 
vertical component from this value would reduce the cross-coupled 
energy and thereby improve the ratio. Similarly, a positive rotation of 
the horizontal component would improve the crosstalk ratio in that 
channel. 

One expects differential attenuation at 60 GHz to increase with the 
absolute value of attenuation (much less than at lower frequencies).? 
This appears to be true on the average, as seen in Fig. 10. For fading 
levels greater than 25 dB, the differential appears to remain constant, 
but the limited amount of data available for very large fades makes de- 
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Fig. 8—Behavior of the various polarization components for the shower of July 31, 
1970. 


termination of the trend uncertain. The data points are very scattered, 
most likely due to variations of factors such as size, shape, and tilt of 
the raindrops involved. A number of negative differential attenuations 
are found for fades up to about 17 dB, but none above that value. The 
solid curve in Fig. 10 is drawn through the average value of the 
differential. A more significant quantity might be the maximum values, 
shown by the dashed curve; the maxima seldom exceed 2 dB, which is 
much less than the 6 dB observed? at 30 GHz for the same magnitude 
of fading. 
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Fig. 9—Behavior of the various polarization components for the shower of Novem- 
ber 4, 1970. 


V. CONCLUSIONS 


Consideration of propagation measured over a period of one year 
on a 1-km path in New Jersey, at a frequency of 60 GHz, leads to the 
following conclusions concerning this path and this frequency: 


(z) Aside from polarization effects, a rain-fading margin of 40 to 
42 dB would have been needed to meet a reliability objective 
of less than 5 minutes outage per year on a 1-km path. 
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Fig. 10—Plot of measured differential attenuation as a function of fade depth. 


(21) The 60-GHz attenuation produced by rain is usually, but not 
always, greater for horizontally polarized than for vertically 
polarized waves; the differential is seldom greater than 2 dB 
and the average differential is only 1.25 dB, even for fades 
greater than 30 dB. 

(i2t) Rainstorms affect the polarization of 60-GHz waves trans- 
mitted through them, although not nearly as seriously for a 
given fade depth as at lower frequencies. A system accommo- 
dating a crosstalk interference ratio of 22 dB would operate 
satisfactorily over this path, suffering no additional outage over 
that resulting from attenuation. 
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Rain-Scatter Interference in Terrestrial 
Microwave Systems 


By K. BULLINGTON 
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The potential interference caused by scattering from raindrops is viewed 
as foreground reflections that “‘spovl” the off-beam antenna pattern. This 
approach greatly simplifies interference calculations and is sufficiently 
accurate for most engineering purposes. 

The principal uncertainty that limits not only this simplified approach 
but also the “‘exact’’ solution is the inability to characterize the variable 
rainfall distribution along any given radio path by a single parameter, 
such as the rainfall rate, either at a point or as an average along the path. 


Il. INTRODUCTION 


The potential cochannel interference between two stations, illus- 
trated on Fig. la, is usually assumed to occur along the shortest 
path (dotted line) between stations. The transmission loss along the 
dotted path includes not only the free-space loss, but also any addi- 
tional loss caused by obstructions or by site shielding. Transmission is 
further modified by the antenna patterns in the direction toward the 
other site. In addition, reflections from nearby trees and terrain may 
reduce the expected suppression of the antenna in directions away 
from the main beam. 

If the two antenna beams intercept, any reflection from rain, air- 
craft, or birds in or near their common volume can provide another 
path by which interference can occur. While the magnitude of such 
reflections is normally small, this ‘‘sneak’”’ path has free-space trans- 
mission with full antenna gain at one end and, thus, bypasses the 
suppression that is normally provided by the antenna patterns and 
shielding losses. 


ll. APPROACH 


A simplified procedure for estimating the transmission loss by way 
of reflection from rain can be obtained, in most cases, by replacing 
Fig. la with Fig. 1b. In this case, antenna G is replaced by a phantom 
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Fig. 1—(a) Plan view of two cochannel stations. (b) Interference path from common 
volume. 


antenna, Gi, whose equivalent gain relative to an isotropic antenna is 
given on Fig. 2 for various frequencies and rates of rainfall. The 
phantom antenna, Gi, can be considered to be located at the beam 
intersection, which is at a distance re from G2. 

In other words, the effect of rain is to fill in the nulls and reduce the 
normal side- and back-lobe suppression of the antenna, and then to 
raise it to within line of sight of the distant station. Even when all of 
the energy from the transmitter is scattered, G; cannot exceed the 
unity ‘‘gain” of an isotropic antenna. This means that for comparable 
antennas, distances, and transmitter power, the unwanted scattered 
signals will be less than the desired signal by the gain of the antenna 
nearest the heavy rainstorm. 
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Fig. 2—Effective antenna gain G; during rain. 
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If the two antenna beams do not intersect, the antenna gain G; is 
reduced in accordance with its directivity pattern for the off-beam 
angle between its axis and the direction needed for full intersection. 

The use of Fig. 2 requires an estimate of the rainfall rate R to be 
expected for a given percentage of the time. Typical values in current 
use that are expected to be exceeded for no more than 10-4 of the time 
are given in the following table.! For a probability of 10-5, the above 
values of 10 log Z should be increased by about 6 dB. 


Section of USA R 10 log Z* 
Southeastern and Gulf 80 mm/hr 53 dB 
East and Midwest 56 51 
Pacific region 33 47.5 
Rocky Mountain region 13 41 


* Reflectivity factor Z ~ 105 (R/50)!-5. 


While Fig. 2 provides a first-order solution that is sufficiently 
accurate for most engineering purposes, the complete rain-scattering 
problem is, of course, much more complex. It is necessary to consider 
the more general problem so that the quantitative significance of the 
many secondary factors can be evaluated and included whenever it 
is necessary to do so. Several solutions to the more complete scattering 
are given in Refs. 2 through 4. In spite of the variety of methods, 
symbols, and units, all of the papers referenced lead to essentially the 
same quantitative results for the same set of assumptions, because all 
use the same meteorological premises. The seeming complexity and 
diversity result in part from the three-dimensional geometry and in 
part from the inclusion of several factors that have been neglected in 
the first-order solution in Fig. 2 because their quantitative effects are 
small. 

Atmospheric absorption is omitted from the first-order engineering 
solution because it is an unnecessary complication at 4 and 6 GHz; this 
loss can be included, if needed, at the end of the computation. To some 
extent, the omission of attenuation caused by rain is compensated for 
by the asymptotic behavior of the curves shown on Fig. 2. These 
curves include a factor to ensure that G, <1. More complicated 
methods (that theoretically are more accurate) add the expected 
absorption loss and then offset it in part by a gain due to ‘forward 
scatter.’”’ The difference between the two procedures is expected to be 
less than the effect of the uncertainty in the effective rain rate for a 
specified location and percentage of the time. 

The polarization-mismatch factor is an additional loss of about 
20 log cos ¢ dB, where ¢ is the angle between the polarization of the 
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desired and undesired signal. It is a safety factor that can ordinarily 
be neglected, because the accuracy of the other factors in the overall 
computation seldom warrants this precision. Some methods add 3- or 4- 
dB loss for this factor, but any assumption of a significantly larger 
amount (say, 20 to 25 dB because the interfering polarizations are 
expected to be orthogonal) seems unrealistic because the normal cross- 
polarization discrimination can be significantly reduced during rain. 

In addition, some references include a dielectric factor [(e — 1)/ 
(e + 2)}*%. This factor is negligible for rain but is automatically in- 
cluded in the factor Z used in Fig. 2. 


Example 


When the two stations illustrated in Fig. la are separated by z = 30 
miles (50 km), the free-space loss between isotropic antennas is 138 dB 
at 4 GHz. The net transmission loss along the z direction is 
138 — Gi, — Gez + S, where S is the shadow and shielding loss in 
dB and the antenna gains Gi, and G2, (along the z direction) are also 
expressed in decibels. On the other hand, the coupling loss from inter- 
secting beams illustrated in Fig. 1 is 188 — Gi — Gz + 20 log r2/z, 
where G; is the equivalent antenna gain shown on Fig. 2 for the sta- 
tion nearest the rainstorm and G, is the antenna gain along re of the 
more distant station. With 40-dB antennas at 4 GHz, z = 50 km, 
re = 62 km, and a rainfall rate of R = 100 mm/hour, the coupling 
loss by way of rain scatter is 188 — (—25) — 40 + 2 = 125 dB. 

In calculating the data in Fig. 2 it was assumed that the area of 
maximum rainfall has an effective diameter D = 1 km; if the user 
prefers some other value, say, D’ in km, the rain rate to be used in 
entering Fig. 2 should be R’ = RD’. The model also assumes 
(G;/ri) = (G2/r3), but this is no restriction because of reciprocity. 


Il. DISCUSSION OF THE SCATTER PROBLEM 


The remaining sections review the scattering equation that leads to 
the results given in Fig. 2. In addition to the rain-scatter problem, the 
general method given below also derives the radar equation, the ex- 
pression for passive repeaters, and the equation for rough or smooth 
ground reflections. The free-space transmission loss for a path length r 
and a wavelength d is given by 

Pr GiA2 = GiG.? _ A,A.2 


Py 4nr (4nrj? (Ar)? ’” (1) 


where G is the antenna gain and A is the area of the antenna aperture. 
The transmission loss Pr/Pr by way of reflection from a large plane 
surface, illustrated in Fig. 3, is the product of two free-space paths. 
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Fig. 3—Reflection from passive repeater. 


The first section has an effective receiving area (cross section) denoted 
by o and the second section has an effective transmitting antenna gain 
g that is related to both the cross section o and the degree of both 
smoothness and reflectivity of the plane reflector. By algebraic re- 
arrangement, the expression is separated into the free-space loss to 
the image of the distant antenna and a power reflection coefficient: 


Free space Reflected 
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K = 1 for perfectly smooth surface (mirror) 
~ exp (—¢) for irregular surfaces 
— 0 for very rough surface, 


and 


@ = standard deviation of phase variations caused by irregular 
surface. 


By the simplifying assumptions that ordinarily apply to radar 
(Gy = G, = 49Ao/M;3 71 = re = 713g = 1; K <1), eq. (2) reduces to 
the radar equation 

Pr Go 


Py ~ Gyr 8) 
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When both dimensions of the plane reflector in Fig. 3 are large 
compared with the first Fresnel zone (VAdo), the cross section ¢ = do. 
For this case, the transmission loss is the free-space loss along the 
image path multiplied by a power-reflection coefficient : 


Pr = GiAe ek =5 G3A>2 

Be ( gee atoae )( Sean) > (gee etoa 2) 
When the y dimension of the plane reflector is less than the first 
Fresnel zone, c = yvVAdo, the corresponding transmission loss is 


Pe ~ (geet) sas): ) 


Finally, when both dimensions of the reflector are small compared with 
the first Fresnel zone, o approaches the geometrical cross section of the 
reflector, and the corresponding transmission loss is 


(eet {Ey @. 


In all of these cases, the first bracketed term is the free-space loss 
along the image path. 

When the cross-sectional area o of a single reflector is small compared 
with \?/4m (or its surface is so rough that K —0), the energy is 
scattered in all directions. The antenna gain of the scatterer [see (2) ] 
becomes g = 1, and this leads to 


r-(45)(%5)-(a) 
Pp \4ar3]\ 4ar?]  \ 4ar? ]\ Aj’ 





where 
i ae 
4n BivBix 


A, = cross-sectional area of beam from antenna 1 at distance 7; 
and 
6y,4 = antenna beam widths in radians (assumed uniform). 


The assumption of isotropic scattering is not strictly correct, but 
Setzer’ has shown that, for frequencies below 30 GHz, the difference 
between isotropic scattering and the exact value is less than 3 dB at all 
angles. 

The next step toward solution of the rain-scatter problem is to 
assume a large number (NV) of small particles, each with a cross 
section ¢ < \?/47, randomly spaced with an approximately uniform 
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Fig. 4—Geometry of common volume. 


density throughout the common volume illustrated in Fig. 4. The 
resulting transmission loss is 


Pe ~ (a )(ar )O™ = (aeat) ©) (ew) 


-(G)@, © 





where 
N = number of particles per cubic meter, 
V = common volume A’D, 
and 
pas 
~ gin 


= size of rain cell with 
= \ maximum “uniform” rain rate 


The first bracket in (6) is the free-space transmission loss for a 
distance rz. between an isotropic antenna and an antenna whose gain 
is G_ = 4rA2/d*. The term oN is a meteorological parameter (total 
cross section per cubic meter) that depends on the rate of rainfall and 
the wavelength. Essentially, the same assumptions on the distribution 
of raindrop diameters and on their velocity of fall are used by all the 
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references with the result that 


_ W10-8Z 
aS a 
reflectivity factor, 

180 R!-§ from Ref. 2, 

= 400 R}“ from table in Ref. 3, 
200 R}-6 from Ref. 4, 


oN meters}, (7) 


Z 


and 


R = rainfall rate in mm/hour. 


Since the primary interest is in heavy rains (R > 50 mm/hr), this 


work uses 
Z - R 1.5 8 
= 10 (5) (8) 


which is in good agreement with all references in the vicinity of 
R = 50 mm/hour, and splits the difference for much smaller and much 
larger values of R. 

The equivalent antenna gain given in Fig. 2 is based on 
G’ = gND/(1 + cND) as defined in (6). The parameter D represents 
a somewhat uncertain (or “rubbery’’) length which, perhaps, can best 
be defined as the smaller of either V/A’ or the diameter of the rain 
cell associated with the peak rain rate. One of the references uses 
D = 4km in calculating its principal quantitative result, another uses 
D = 1km, and a third uses 0.71 km. In this work, a value of D = 1 km 
is assumed in the results shown on Fig. 2. 

Finally, the factor 1/(1 + ¢ND) has been included to prevent a 
calculated value of G’ greater than unity, because such a result would 
be inconsistent with the basic assumption of isotropic scattering. 


IV. RAIN ATTENUATION 


As previously mentioned, the first-order solution (using Fig. 2) 
considers only the scattering and omits the attenuation caused by 
rain. The attenuation computed by Setzer is shown by the solid lines 
on Fig. 5 and agrees with values given previously by Medhurst.®.’ 
The dotted lines are calculated from the following empirical formula, 
which may be useful for engineering purposes: 


— CYT dB/km, (9) 
a 


\ = wavelength in cm. 


where 
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Fig. 5—Rain attenuation. 


As in the scattering problem, there is some uncertainty regarding 
the effective distance to be used with the data in Fig. 5. The distance to 
be used in computing attenuation is greater than the diameter of a 
rain cell associated with the maximum rate of rainfall but cannot 
exceed the path length. Many authors prefer to think in terms of the 
average rate of rainfall over the actual path length, but this merely 
transfers the uncertainty from an effective distance to the determina- 
tion of the average R. Based primarily on measurements in New Jersey, 
Hogg has related the average rainfall rate along a path to the rainfall 
rate at a point in the same climate.® Typical values to be exceeded for 
less than 10-5 of the time are given for path lengths in the 1- to 10-km 
range and extended to other climates. The results can be represented 
approximately by the following empirical formula: 


BR o/.. Ra\"} 
5- (1+) (10) 


where 


R = average rainfall rate along path d (in km). 
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In principle, the rain attenuation can be computed if the number and 
size of drops are known at every point in the space between the two 
antennas at every instant in time. In practice, significant variations 
with time and location are likely because the distribution of the rain- 
storm is completely independent of the radio path. 

Rain attenuation estimates are necessary in calculating the fade 
margin against set noise, but only the difference in attenuation be- 
tween signal and interference affects the carrier-to-interference ratio. 
When the desired signal (C) and the interfering signal (J) are at- 
tenuated equally, the C/I ratio is unchanged and the interference-to- 
noise (I/N) ratio is less than in the absence of attenuation. When the 
desired signal is attenuated more than the interfering signal (as, for 
example, by heavy rain at a location to the left of the intersection 
illustrated on Fig. 1a) both the C/I and C/N are decreased and either 
one may be controlling. Estimates of outage time caused by rain at- 
tenuation (low C/N) can be obtained without reference to rain scatter 
or other types of interfering signals. Conversely, estimates of the 
probability that a rain-scatter interference will exceed the set noise (or 
some lower limit required by the difference in interfering effects be- 
tween thermal noise and a single frequency tone) can be made inde- 
pendent of the rain attenuation. For an efficient microwave transmis- 
sion system, it is essential that the probability of service degradation 
caused by interference should be less than for set noise alone. Any 
more sophisticated attempt to combine the two separate probabilities 
of outage requires more information on the distribution of rainfall 
with geography and time than is available. 


V. CONCLUSION 


The interference caused by scatter from raindrops is a complex 
problem, as indicated in the references. Some complexity is unavoid- 
able when all pertinent factors are to be included regardless of their 
magnitude. Fortunately, a relatively simple engineering method for 
interference calculation can be obtained by neglecting several factors 
whose magnitudes are small compared with the fundamental uncer- 
tainty in the exact rainfall distribution along the path. 
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Optimum Timing Phase for an Infinite Equalizer 


By J. E. MAZO 
(Manuscript received September 14, 1973) 


A digital equalizer for data transmission linearly combines a sequence 
of samples [b(7 + kt), k = 0, +1, --- ] of the received data wave to miti- 
gate the effects of intersymbol interference and noise. A natural question is, 
How will the performance of such a system depend on the tuming phase r, 
Se ee ee 

We examine this problem in considerable detail for an infinite equalizer 
using a@ mean-square measure of performance. Excess bandwidth results 
in a significant difference in performance between the best and worst timing 
phase. Under practical noise conditions, it is estimated that the excess 
bandwidth must be down to 1 or 2 percent before the timing effect becomes 
insignificant. With a 10-percent roll-off, a 3-dB penalty can be incurred 
by choosing a bad timing epoch. 

Our main result is that under conditions likely to be encountered on 
channels similar to voiceband telephone channels, the optimum sampling 
instants will be accurately approximated by the consecutive maxima and 
minima of the sine wave that result when an alternating sequence of posi- 
tive and negative pulses (dotting sequence) is transmitted. There are no 
additional local minima of the minimum mean-square error as the timing 
phase is varied. 


I. INTRODUCTION 


We consider a noisy, random, baseband pulse train, 
b(t) = Danx(t — nT) + n(Q), (1) 


where x(¢) is the channel (or channel front-end filter) impulse response 
and the a, are independent binary data which take values +1 with 
equal probability. The additive zero mean gaussian noise process is 
denoted by n(t). Under very ideal conditions, the pulse x(t) would be 
of the form (sin rt/T)/(at/T), and sampling (1) at time t=kT would 
yield the quantity a, + n(kT). Under more realistic conditions, the 
brick-wall shape of the above pulse is difficult to approximate in prac- 
tice, and a smoother characteristic in the frequency domain is taken as 
the ideal. Using some excess bandwidth (i.e., the Fourier transform 
extends beyond 7/T rad/s) still, in principle, allows the ideal set of 
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samples expressed by 
x(0) = 1, «(kKT) =0, k #0; 


but this ideal set is never exactly attained in practice because of un- 
known channel distortion. One possible measure of distortion could be 
LY (kT) 
k #0 
a0) 
Actually, since there is now nothing special about choosing ¢ = 0 as 
the sampling instant for x(¢), an even more appropriate measure 
would be 
p> v(r + kT) 
a -a@ 

The above measure allows us to choose a best sampling epoch 7 before 
declaring how badly the signal has been distorted. However, any direct 
consideration of the above type of criterion seems to result in con- 
siderable mathematical difficulty. Also, in practice, one is more in- 
terested in the situation where the set of digital samples {x(7 + kT)}, 
or rather their noisy versions {b(7 + kT)}, are linearly combined by 
an equalizer attempting to undo the channel distortion. It is the dis- 
tortion at the output of the equalizer that is of practical interest. 
Remarkably enough, it is this distortion measured at the output of an 
infinite equalizer that is most easily studied with regard to its properties 
as a function of timing phase. A discussion of this dependence follows. 
We begin by making more precise some of our general remarks. 

The received signal b(t) is to be equalized by passing it through a 
(2N + 1) tap transversal filter to yield for the signal portion of the 
output a waveform 


g(t) = Lianh(t — nT), (2) 
where 
h(t) = S cra(t — nT). (3) 


The coefficients c, are the tap weights of the equalizer and are adjusted 
to optimize some measure of performance. Since decisions are based 
on the sampled output q(t), one usually discusses the problem by fixing 
a sampling epoch 7 and denotes the sampled values of x(t) and A(t) 
by 2, and hx, respectively. That is, 


“(r+ kT) = x, 
a k=-+-—-1,0,1,--.. (4) 
Ate + kT) = he = DY Caen 
n=—N 
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We assume throughout that the tap weights are chosen to minimize 
the expression 


&= Yl AB + (ho — 1)? + 03, (5) 


where of is the sampled noise variance at the output of the equalizer, 
and the prime on the summation indicates deletion of the n = 0 term. 
Letting R, = R_, be the (discrete) correlation function of the sampled 
noise at the input to the equalizer, we see that 


N 
oe = > C;Rj-14Ce (6) 
j,k=—N 
and 
N N 
= > cj (Rj. + A j-k) Ck —2 i es 1, (7) 
j,.k=—N n=—N 

where the channel correlation matrix, 

7 = _s Le—nLk—m; (8) 


has been introduced. The optimum tap settings (copt)n are determined 
by differentiating &? with respect to the tap gains. This yields a linear 
system of equations for the best tap gains which, in an obvious matrix- 
vector notation, reads 


(A + R)copt = 2. (9) 
Using this in (7), we then obtain for the minimum mean-square error 
G2in = 1 — ho. (10) 


Finally, we may state our problem. The above discussion of known 
results was in terms of a fixed timing epoch r. Here we shall determine 
for some special but important cases the value of 7, which will minimize 
62m, as defined above. Instead of resorting to a direct numerical in- 
version of (9), we introduce the approximation of treating the equalizer 
as infinite. This approximation aids the analytical inversion of (9) and 
thereby directly leads to useful insights concerning the proper timing 
epoch. It will be seen that in the N = ~ limit, timing recovery is only a 
problem when the received pulse x(t) has a bandwidth greater than 
1/2T Hz, a result that could have been guessed from the sampling 
theorem. With the practical situation of excess bandwidth, there is the 
possibility that a badly chosen timing phase will considerably de- 
grade performance. Only when there is very little excess bandwidth, 
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say of the order of one percent, is the timing unimportant in practice. 
Our main result can be best stated in terms of the pure sine wave that 
appears at the channel output (sampler input) when the sequence 
an = (—1)” (dotting sequence) is transmitted. The optimum sampling 
instants are, under certain conditions, the consecutive maxima and 
minima of this wave. Furthermore, if the timing phase is varied, no 
extraneous local minima of the minimum mean-square error (regarded 
as a function of timing phase) will appear. 


Il. MATHEMATICAL DETERMINATION OF OPTIMUM TIMING PHASE 


On comparing (8) and (9), we see that the infinite matrix equation is 
in the nature of a convolution and, therefore, is best solved by trans- 
form methods. Thus, given a sequence {z,}, we define a transform 


Xeq(w) = TY ta exp (—iwnT), |w| SZ, (11) 
with the x; recoverable from X.,(w) in the obvious way: 
1 r/T : 
Le = — if X eq (w)e*’*Tdw. (12) 
2m J—(r/T) 


This formula shows that, from the impulse-response point of view, 
Xeq(w) is, in fact, the Nyquist equivalent spectrum of x(t). That is, 
if X(w) is the Fourier transform of x(t), then 


Xa) = SF X(w-ka), lw| s Z (13) 


In this transform language, the solution of (9) is* 


TXeq(w) 








C(w) = i (14) 
Req(w) + vi | Xeq(w) |? 
In writing this, we have made use of (8) to write 
A(w) = 7 |Xea(w) [2 (15) 
Equation (4) gives the relation 
H(w) = F.C(w)Xeq(w) = Hal (16) 


1 
Reqg(w) + 7 | Xeq(w) i 


* We will not always use the subscript ‘‘eq”’ on the transform (11) when there does 
not exist an associated impulse response in a natural way. 
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and, hence, from (10) and the recovery formula for ho, we have 


x{T 
Bee = il iw) dw. (17) 
ee Req(w) + T | X eq (w) 7 





Let us now explicitly put timing recovery into this solution for &2yn. 


In evaluating the above formula, one would naturally calculate X (w), 
possibly by choosing t = 0 as the peak of the pulse x(t), and calculate 
Xeq(w) by (13). Any other timing epoch 7 can be obtained by replacing 
X(w) by X(w) exp (twr). This corresponds to sampling the original 
a(t) at ¢ = 7 instead of t = 0; in general, X.,(w) will depend on r. 

For further specifics, we limit our discussion to small but nonzero 
excess bandwidth, letting a denote the fraction of excess bandwidth. 
Thus, (1 + a)z/T is the total bandwidth (in rad/s) occupied by the 
baseband pulse, a < 1. Further, we take the noise at the input to the 
equalizer to be independent from sample to sample and to have vari- 
ance o7; thus, 


Req(w) = To?. (18) 
Equation (17) thus becomes 
xr/{[T 
Sim = 015 [| 
—(1/T) o2 BS Te | Xeq(w) |? 
T privta-a) dw 
sal f ae 
Tw J—(9r —a 
CE? oi + 72 | Xeq(w) |? 
w/T 
ais goat if dw (19) 
2r 


7 —a 1 
FIMO 63 +. = |Xeq(w) | 


We note that the first term in the right-hand side of (19) is not affected 
by timing phase since the portion of |X.q(w)| for |w| S 7/7T(1 — a) 
is not affected, whereas the second term, on account of the foldover 
from excess bandwidth, is affected.* 

To obtain estimates regarding the magnitude of the effects of poor 
timing phase, we consider the case of an undisturbed pulse x(t) having 
a transform X(w) that is equal to T for 0 S$ w S (1 — a)a/T and 
decreases linearly to zero for (1 — a@)x/T Sw S (1 + a)a/T, with 
X(—w) = X(w). If this pulse is sampled at t = 0, the “main bang” 
has value unity, and there is no intersymbol interference. From (19), 
the minimum mean-square error is o23(1 + 0%) & o% for small noise. 


* Of course, if there is no noise, the equalizer inverts X-q(w) (if the latter is not zero) 
without any penalty, and the question of timing is moot. 
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If, however, it is sampled at 7 = T/2, we have 


T?, 0Sws(1-a)z 
2 = 
Xea)l®=) 7 gs ee 
em \ Ti)? Sy ea 


and, hence, with the infinite equalizer doing the best it can, we obtain 
from (19) for small noise 
Ein = (1 — ado? + ao; tan“ 7 (21) 


4 


& (1 — ajo? + 5%. 

If for binary transmission we assume a 10 or 15 percent roll-off (a = 0.1 
or 0.15) and o? = 0.01 to 0.04, we see that a degradation of from 2 dB 
to 4 dB might be encountered. Only if the roll-off is of the order of 
1 or 2 percent is the effect of timing negligible for the value of noise 
power considered here. 

We now concentrate further attention on the second term of (19). 
We introduce a variable 

tg 


p= 2 |w-5 | (22) 


for c/T(1 —a) Sw S r/T(1 +a); thus, —1 S » S$ 1. When w and 
ware related by (22), we define a nonnegative amplitude function R(w) 
and a real phase function g(u) by the relation 


X(w) = Rue, R(x) 2 0. (23) 


Further, we introduce the even and odd parts of amplitude and phase 
about the Nyquist frequency w = 7/7 (u = 0): 


R(u) = Re(u) + Ro(u) 


R.(u) i. Re(—2); Ro(u) _ —Ro(—n); (24) 
elu) = ge(u) + go(u) 
gol) = ee(—H) 3 go(u) = —g0(—n). e} 
Thus, for —1 S » S 0, we calculate 
| Xeq(u) |? = 4[(R?2 — R2) cos? o, + Re ], (26) 


where, for notational simplicity, the » dependence of the function on 
the right has not been shown. Using (26), the second term S of (19) is 
written 





nad [ : du ; (27) 
oo 7 [(R2 — Ré) cos? ¢. + R54 
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A number of remarks concerning (27) are in order. 


(2) The odd component of phase distortion about the Nyquist 
frequency does not influence ‘‘foldover”’ or the timing recovery 
problem. 

(ii) From the positivity of R(x), we note that (R? — R?2) = 0. We 
assume that [R2(u) — R§(u)] > 0 on some interval contained 
in-lSypS0. 

(427) Relative to an arbitrarily selected sampling instant, called zero, 
the timing phase 7 enters ¢, through 


Ge = 7 a Be(u), (28) 


where @,(u) is the phase characteristic for 7 = 0. It is evident 
that the mean-square error is a periodic function of 7 with 
period 7. Physically, if 7 is displaced by an integer (k) mul- 
tiple of T,, the ‘‘center tap” will be shifted & units down the 
equalizer. 

(7v) In many situations, ¢, can, to a first approximation, be treated 
as a constant. A recent survey! shows that an ‘‘average’”’ tele- 
phone channel has an envelope delay that changes by 312 us 
from 2850 Hz to 3000 Hz. Assuming this to be an excess band- 
width region and the change to be due entirely to a quadratic 
term in the phase curve yields a phase change of 8.4 degrees.” 


We now come to the main point of this section, which is to recognize 
where the optimum timing phase is. This will be done with the sim- 
plifying assumption 

% = 6 = const. (29) 


Since S is a smooth periodic function of +, we look for a minimum by 
examining solutions of 


as 
= =0, (30) 
that is, 
0 2 _ pe 
ae 200 | a ce ee ey - 315 


-1 4 * 
Jot + pe L(RE — RB) cost o + RE] 
From (iz) we deduce that the integral is strictly positive and, hence, 
the only solutions are those of 
sin 2¢, = 0. (32) 
* This is meant to be a rough estimate only. No attempt was made to separate out 
even and odd contributions to the change in phase. Low-end envelope delay is more 


severe and this separation should be carried out so as to obtain a not-too-pessimistic 
answer. 
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Thus, because of periodicity the only solutions 7 of interest may be 
written 


7+0=0 (33a) 


m3 


or 


prtots=0. (33b) 


By taking second derivatives of S with respect to 7, we see that the 
first of these, namely (33a), corresponds to the minimum of S while 
the second solution is a maximum. In addition to giving a description 
of optimum timing phase, our discussion has also shown that for 
ve = const. there are no other local minima of &2,, as 7 is varied.* Thus, 
a gradient search for the optimum 7 is feasible. More precisely, we 
have shown this for the quadratic Nyquist I criterion (5), and have 
assumed that the search is slow in the sense that the equalizer ‘‘settles 
down”’ before the next step in 7 is taken. 

Equation (33a) provides a mathematical description of the optimum 
timing phase 7 in terms of the phase 6 of the channel at the Nyquist 
frequency 7/7. It is useful to give a more physical interpretation of 
this result. Consider the transmission of the special sequence an 
= (—1)*. From (1), we receive the signal 


X (—1)*2(t — nP), 


n=o 


which has as its Fourier transform 
X(w) 5 (—1)" exp (—éwnT) 
= X(w) pz exp (—iw2kT) — X(w) exp (—wf) | > _ OxP (—iw2kT) 


= FX w) ¥ 3(w -F)-7 Bintan Salvo’ 


= PX) ¥, 8 (w- FY: 


Hence, if X(w) is band-limited to a bandwidth less than 32/7’, and if 
we write X(1/L) = A exp (26), the received signal is 


4 wt 
7 A cos (F + 0) : 
Comparing this with (32) and (33a), we see that the optimum timing 


* See the appendix for a more general condition under which the existence of a 
unique minimum is guaranteed. 
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phase is at the successive maxima and minima of the tone that results 
when a dotting sequence is transmitted. 

Finally, we note that corrections for a small, varying, even compo- 
nent of phase distortion about the Nyquist frequency are easy to make. 
If we define y by 


ar to=y, (34) 


where, again, @ is the phase at the Nyquist frequency, and if we write 
[recalling (28) ], 


ge(u) = y+ e(u), €(0) = 0, 
where e(z) is assumed small, then 
0 (RI- Rieu), 
»_ _ Jato + GTR 
i [ (iF) 4’ 
-1 (oF + G/T?) RB" 


(35) 


where 7* is the approximate value of y for optimum timing phase r. 
We see y* is bounded in magnitude by max e(u), but in general it will 
be a fraction of this. Assuming max e(u) corresponds to 9 degrees, this 
rough bound suggests that a correction of at most 5 percent of a sym- 
bol interval would be required. 

For a more refined estimate we take the following 


Re(u) = 1, Ro(u) =u, and e(u) = ep. (36) 
Note that R? — Rj = (R. — Ro) (R. + Ro) = R(u)R(—xz). Since R(1) 
will vanish by the design of the signal, R2 — Rj will be zero at p = —1 


and, thus, no multiplicative constant is included in Ro(u). Using (36) 
in (85) yields 


0 
J (1 — p?)wWdu 
y* Ye — _____. = 
[GQ - wd 


(37) 


Ou m 


Thus an estimate of $ of the even part of the phase variation over the 
roll-off band results for the correction term to (84). This would appear 
to be a negligible effect. 


Ill. DISCUSSION FOR PASSBAND TRANSMISSION 


We now generalize the foregoing discussion to treat a QAM passband 
signal format. We assume a transmitted signal 


[> an.p(t — nT) ] cos wt — CX bap(t — nT) ] sin wet, (38) 
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where (a,) and (b») are independent digits. Let the impulse response 
of the linear distortion medium m/(é) be written as 


m(t) = 2F,(t) cos wt — 2F2(é) sin wet. (39) 


Further, denote the convolution of p(¢) with Fi(t) and F2(t) by x(t) 
and y(t), i.e., ; 
p(i)*Filt) = «() pt)*F2() = y(@). (40) 


Then the received signal is 


r(t) = cos wtl > anz(t — nT) — & bay(t — nT) ] 
— sin wtl> any(t — nT) + dO baxz(t — nT) ] + nt), (41) 


where n(t) represents additive gaussian noise. The waveform r(¢) is then 
linearly filtered by a passband equalizer, whose output can be de- 
scribed by replacing x(t) by g(¢) and y(¢) by h(t), where 


N N 
g(t) = = crt(t — nT) — = dny(t — nT) 
and (42) 
N N 
h(t) = x d,zx(t — nT) + x cny(t — nT). 


Further, if the input noise n(t) has correlation R(t), then the output 
noise has variance 
o3 = ctRc + d+Rd, (43) 


where R;; = RL(@ — j)T]. We adopt as our criterion 
= Tit gt Y Mt of (44) 


and seek to minimize this quantity. Analogously with the baseband 
case we define two baseband autocorrelation functions [corresponding 
to x(t) and y(t) in (41) ] by 
A vem — 7 eee = >» Lj-nXj—m 
j=-o 


and (45) 
Bam = Brom = p> Y j—-nY j—m- 
j=-0 
Further, a cross-correlation matrix between x(¢) and y(@), namely, 


Kam = Knem = —Kinn = 2 (Lj—nYj—m — Lj—-mYj—n). (46) 
j 
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If, as before, we write (%)n = Yn, (y)n = y—n, then 
6 = ct(A + Be + dt(A + B)d — 2ctKd — 2c-x + 2d-y 
+1+ctRe + dtRd. (47) 


If one defines the matrix Q = A+ B+R, the equations for the 
optimum taps are 


D (Qn—mCm ) Kym) = Un 


4 

> (Ge-nbn + K tala) = —Y-n ( 8) 
and 

Grain =l1- (qo) opt =1- Copt + doptY- (49) 

We introduce the same transform as before, noting 
Xe ? Y. 

Q(w) = ea)? Ween | Rw) = Q(-w) (50) 

and 


Kaye 7 [Xe (00) Yea) Skike wUGD 
Straightforward solution of (48) and (49) yields 
go Q(w)R(w) 
Emin = Qa a (jt) Q?(w) + K?(w) ey 
ae i [Q(w) — iK(w) |R(w) ays 
2a Jicej7) Q?(w) + K?(w) 


e ve x/T R(w) 
Or [ (eit) Q(w) + 1K (wy (52) 


where the oddness of K(w) [and evenness of Q(w) and R(w) ] has been 
used. If we formally introduce S(w) by 


S(w) = XWw) + 2¥ (wv), (53) 
and 
Seq(w) = X eq (w) == 1Y eq), (54) 


we see that 


| Soq(w) |? = [Xeq(w) + tY eq(w) |LXe q(w) — tYeq(w) ] 
= = |Xeq(w) |? oe | Yeq(w) |? 
+ IY eq (w)Xeq(w) — 1X eq(w)Ye a(w) 
= TLA(w) + Bw) + 7K (wu) ], (55) 
or, briefly, 


ae 


Q+ 1K = + R&R. (56) 
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Thus, 





T w/T R(w) 
Shain = 2 aw. ee 
Qr dik eee + R(w) 


This formula has a close analogy to the baseband formula when we 
realize that the overall channel (transmitting and distorting filter) 
has frequency characteristic 


C(w) = $S(w — we) + 3S*(—w — wre). (58) 


Thus, in terms of the overall channel, (57) has a straightforward 
interpretation. One uses a formula similar to the baseband version (17) 
except that now one forms an equivalent Nyquist characteristic using 
twice the positive frequency part of the passband channel and treats 
the carrier frequency as zero. It is essential to note that the transfer 
characteristic from which S,,(w) is formed no longer needs to have 
even amplitude and odd phase about ‘‘zero’”’ as would be required in 
the baseband case. In particular, the envelope delays at the two band 
edges can have an effect on timing phase. In practical cases, the carrier 
frequency can be chosen to give equal envelope delays, thus minimizing 
the effects of odd phase about the band edges. However, in doing this, 
the amplitude effects may have worsened the situation. The main 
point to be emphasized here is the interplay of carrier placement and 
bit timing with an excess bandwidth system. 

Regarding the description of optimum timing phase, let y* be the 
channel phase at w = w. + 7/T, and assume these phases to be con- 
stant over their respective roll-off regions. Then, for any reasonable 
amplitude distortions, the unique best sampling instant 7 is given by 


T ra glee =e) =, (59) 


Under these conditions, we again have that there exists but one local 
minimum. 
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APPENDIX 
Uniqueness of Local Minimum of Mean-Square Error for Small @.(x) 


The interesting portion of the mean-square error is given by (27), 
which is of the form 
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1 dx 
_ Se eee 6 
d= | eye ie en ee I: oO 
with a(x) and b(z) positive. If y(x) is always small, then any local 
minimum of (60) has to occur when 7 is small. Thus, if | ¢(z)| < 7/8, 
we see that the derivative 


ry . ft 5(@) sin [27 + 2¢(2)] 
vy) a7 {a(x) + b(x) cos? Ly + (x) J}? a ep 


can only vanish if |y| < r/8 or |r — y| < 2/8, for otherwise the 
integrand always has the same sign. Let us investigate the solution 
in |y| < «/8 in more detail. Taking another derivative gives 


” = ! b cos (2y ate 2¢) 
y= 2f oe [a + b cos? (y + ¢) - 








7 b? sin? (2y + 2¢) 
a 2 de [a + b cos? (y + ¢) 8’ (62) 


which is positive. Hence, there can be only one solution of g’(yv) = 0 
for |y| < 7/8 (we cannot have two minima without a maximum in 
between). Likewise, if | g(x)| < 7/8, there is precisely one maximum 
of g(y), and it occurs in the range |r — y| < 7/8. 
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Erratum 


“Delay Distortion in Generalized Lens-Like Media,” B.S.TWJ., 
Vol. 53, No. 2, February 1974, pp. 177-198, by S. E. Miller. 
Equation (35) on page 183 should read as follows: 


_ m A(u — 2) 1 — (ut+2)\} 
Bri, a ee (arp (Qu + ) 8) 
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